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Abstract 


Closing  the  Loop:  Control  and  Robot  Navigation  in  Wireless  Sensor  Networks 

by 

Shawn  Michael  Schaffert 

Doctor  of  Philosophy  in  Electrical  Engineering  and  Computer  Science 
University  of  California,  Berkeley 
Professor  Shankar  S.  Sastry,  Chair 

Wireless  sensor  networks  have  received  considerable  attention  for  their  potential  as  a  cheap,  easily 
deployed,  distributed  monitoring  tool.  Recently,  researchers  have  begun  to  investigate  the  use  of 
wireless  sensor  networks  to  drive  closed-loop  control  systems.  However,  such  composite  systems 
are  nontrivial  to  design  due  to  the  system  interface  dichotomy:  control  systems  typically  assume 
periodic,  high  frequency  sensor  updates  whereas  sensor  networks  provide  aperiodic,  low  frequency, 
and  laggy  sensor  updates.  Utilizing  robot  navigation  and  pursuit-evasion  games  as  benchmarks,  our 
research  focuses  on  improving  control  system  performance  by  exploiting  the  properties  of  wireless 
sensor  networks. 

We  developed  and  deployed  a  real-world,  medium-scale  wireless  sensor  network  for  play¬ 
ing  pursuit-evasion  games.  Using  our  experience  from  this  deployment,  we  highlight  the  difficulties 
in  using  sensor  network  data  to  accurately  localize  robots.  Several  techniques  designed  to  com¬ 
pensate  for  such  difficulties  are  developed  and  incorporated  into  an  unified  system  architecture. 
To  test  our  architecture,  an  application-level  simulator,  accounting  for  many  of  the  sensor  network 
characteristics  that  frustrate  control  design,  is  developed.  This  simulator  allows  us  to  identify  com¬ 
ponents  of  our  system  architecture  that  can  improve  the  performance  of  control  systems  operating 
in  networks  of  sensors.  Amongst  the  components,  intelligent  path  planning  is  identified  as  uniquely 
important  in  improving  robot  localization  accuracy  during  navigation. 

Path  planning  techniques  that  use  information  maps,  exploiting  the  knowledge  of  node 
topology  and  sensor  models,  are  developed.  Information  is  a  metric  for  measuring  the  ability  of 
a  region  in  the  environment  to  aid  in  robot  localization.  In  particular,  for  each  region  in  the  en¬ 
vironment,  an  information  map  computes  the  change  in  entropy  expected  by  a  robot  in  this  area 
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using  Markov  localization.  We  adapt  sensor  network  models  for  use  with  information  maps  and 
verify  the  ability  of  such  maps  to  improve  robot  localization.  Additionally,  automatic  path  planning 
techniques  based  on  information  maps  are  developed  that  minimize  localization  error.  We  compare 
the  performance  of  these  planners  with  other  path  planners,  and  demonstrate  that  this  technique  is 
effective  for  generating  paths  that  increase  the  accuracy  of  localization  while  typically  requiring 
fewer  detections. 


Professor  Shankar  S.  Sastry 
Dissertation  Committee  Chair 


To  my  parents, 

Harry  and  Wilma  Schaffert, 
for  their 
endless  love, 
support, 
and  inspiration. 
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Chapter  1 

Introduction 


Each  year  the  size  and  cost  of  electronics  reduce  producing  a  world  ubiquitous  with  sens¬ 
ing  and  computing  nodes.  Observing  this  trend,  researchers  have  turned  to  developing  hardware 
and  software  platforms  [46,  88]  to  support  distributed  sensing  and  computation  via  hundreds  or 
thousands  of  tiny  nodes.  Each  node,  or  mote,  is  designed  to  be  a  small,  power-efficient  sensing  plat¬ 
form  that  supports  on-board  computation  and  wireless  communication.  Using  a  distributed  array  of 
motes,  or  a  wireless  sensor  network  (WSN),  provides  a  new  sensing  platform  useful  for  a  variety  of 
applications. 

In  the  last  several  years,  many  of  the  foreseen  applications  and  challenges  [35,  75]  of 
WSNs  have  been  actively  researched.  WSNs  have  already  shown  their  usefulness  in  many  dis¬ 
tributed  sensing  arenas  such  as  habitat  monitoring  [20,  61],  meteorology  [59],  information  for  first- 
responders  [38],  and  vehicle  detection  [30].  During  the  development  of  such  applications,  chal¬ 
lenges  on  several  fronts  have  been  investigated  by  researchers:  ad-hoc  networking  [89,  2],  power 
efficiency  [67],  in-network  data  aggregation  [60,  62],  time  synchronization  [34],  and  automatic  node 
localization  [93]  to  list  a  few. 

The  need  to  go  beyond  pure  sensing  applications  and  understand  WSNs  from  a  control  and 
actuation  perspective  has  been  identified  by  researchers  [18].  Several  important  control  applications 
such  as  heating  ventilation  and  air  conditioning  (HVAC)  systems,  pursuit-evasion  games,  and  robot 
navigation  stand  to  benefit  by  the  introduction  of  a  sensor  network  platform.  A  WSN  system  can 
reduce  installation  and  maintenance  costs  for  HVAC  systems  by  eliminating  wires  between  sensing 
and  control  points.  Pursuit-evasion  games  can  acquire  a  global  view  of  the  playing  field  eliminating 
the  need  for  a  pursuer  robot  to  continuously  patrol.  Finally,  robot  navigation  benefits  in  several 
ways.  Sensor  networks  can  localize  robots  for  environments  (such  as  inside  buildings  or  outside 
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under  tree  cover)  that  other  technologies  (such  as  GPS)  cannot.  Additionally,  these  networks  allow 
for  the  localization  of  robots  without  sensors  reducing  the  cost  of  the  robot  platform  and  facilitating 
applications  (such  as  automated  warehouses)  that  could  benefit  by  deploying  fleets  of  such  robots. 
We  refer  to  these  integrated  systems  as  sensor  network  and  control  (SNAC)  systems,  to  distinguish 
them  from  classical  control  systems,  networked  control  systems,  and  systems  for  control  of  the 
network  layer. 

For  a  SNAC  system  to  benefit  from  a  sensor  network  platform,  the  platform  needs  to  be 
easily  deployable,  self-organizing,  robust,  and  long  lasting.  Many  services  are  required  for  building 
a  robust  SNAC  application  such  as  self  localization,  ad-hoc  routing,  detection  and  aggregation, 
and  control.  As  previously  mentioned,  most  of  these  services  have  been  studied  in  recent  years. 
Fiowever,  limited  work  has  addressed  the  design  of  control  systems  capable  of  accurately  utilizing 
the  incoming  information  from  sensor  networks.  Such  composite  systems  are  nontrivial  to  design 
due  to  the  system  interface  dichotomy:  control  systems  [31,  79]  typically  assume  periodic,  high 
frequency  sensor  updates  whereas  sensor  networks  provide  aperiodic,  low  frequency,  and  laggy 
sensor  updates. 

The  goal  of  our  work  is  to  develop  a  general  methodology  that  facilitates  high  quality 
control  services  for  SNAC  applications.  Additionally,  our  work  focuses  on  the  design  of  a  con¬ 
trol  scheme  for  robot  navigation  in  WSNs.  Our  research  approaches  these  topic  step  by  step.  We 
implement  a  complex  SNAC  system  and  provide  practical  insight  into  the  challenges  it  presents. 
We  develop  a  general  system  architecture  for  approaching  SNAC  system  design.  We  develop  sev¬ 
eral  WSN  models  and  a  simulator  suitable  for  testing  our  architecture.  Finally,  focusing  on  robot 
navigation,  we  adapt  localization  and  path  planning  techniques  for  WSNs. 

1.1  Related  Work 

Previous  work  illuminating  our  research  path  is  both  varied  and  broad.  In  addition  to 
addressing  control  systems  and  sensor  networks,  our  work  uses  pursuit-evasion  games  and  robot 
navigation  as  benchmarks  and  illustrative  examples.  Hence,  a  comprehensive  survey  involves  delv¬ 
ing  into  several  research  communities:  classical  control,  sensor  networks,  distributed  systems,  esti¬ 
mation,  mobile  robots,  and  pursuit-evasion  games.  Additionally,  fundamental  to  any  mobile  robot 
application  is  the  ability  for  a  robot  to  localize  itself,  requiring  our  survey  to  include  results  from 
the  localization  and  tracking  communities.  In  fact,  developing  a  robust  sensor  network  system  for 
robot  localization  is  useful  outside  our  area  of  research  in  a  variety  of  areas  such  as  the  hybrid  field 
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of  mobile  sensor  networks.  Applications  such  as  event  sensitive  monitoring  [17,  70]  require  mobile 
sensors  to  gather  around  interesting  events  increasing  the  sensing  accuracy. 

This  survey  explores,  in  order,  estimation  techniques  and  control  (and  navigation)  sys¬ 
tems.  Relevant  pursuit-evasion  game  literature  is  left  for  the  appropriate  chapter.  The  estimation 
survey  covers  tracking  in  sensor  networks,  localization  within  sensor  networks,  and  classical  mo¬ 
bile  robot  localization.  The  control  survey  covers  mobile  robot  control  in  sensor  networks,  classical 
mobile  robot  navigation,  and  extensions  to  classical  control  systems. 

Localizing  a  mobile  robot  can  be  viewed  as  a  sensor  network  tracking  problem.  Such 
problems  have  been  studied  in  various  forms.  Early  work  [76]  uses  a  combination  of  classifiers  and 
Kalman  filters.  Tracking  multiple  objects  is  accomplished  by  first  using  a  series  of  filters  to  associate 
new  detections  with  entities  being  tracked.  Then,  the  track  of  each  entity  is  updated  using  a  Kalman 
filter  and  the  relevant  detections.  Other  researchers  [71,  73]  working  on  this  same  application  have 
employed  Monte  Carlo  techniques.  Oh  et  al  [71]  show  the  utility  of  Markov  chain  Monte  Carlo 
methods  for  tracking  multiple  entities  given  noisy  sensor  network  detections.  The  state  of  a  Markov 
chain  models  the  current  estimate  of  tracks  in  the  network.  The  transitions  of  the  Markov  chain 
are  adapted  to  model  simple  transformations  of  the  estimate  such  as  track  splitting  or  merging.  The 
technique  is  shown  to  successfully  track  multiple  objects  given  noisy  sensor  network  detections. 

Other  sensor  network  tracking  work  [62]  focuses  on  developing  a  generalized  architecture 
for  distributed  estimation.  In  this  work,  the  authors  consider  an  architecture  allowing  nodes  to 
estimate  the  position  of  entities  using  local  sensor  measurements  and  shared  estimation  probabilities 
from  neighboring  nodes.  Techniques  are  developed  for  sharing  estimation  information  amongst 
nodes  to  achieve  favorable  results.  An  architecture  and  a  set  of  filters  are  generated  that  achieve  a 
distributed  Bayesian  estimator.  Additional  work  [13]  by  the  authors  provides  a  brief  overview  of 
the  practical  challenges  of  integrating  these  techniques  for  a  real  deployment. 

Probabilistically  tracking  multiple  objects  in  a  efficient  manner  has  also  been  studied  [57, 
58].  In  this  work,  multiple  objects  are  tracked  using  a  particle  filter.  In  general,  a  particle  filter 
requires  the  use  of  a  joint  distribution  encompassing  probabilities  for  all  the  objects  being  tracked. 
As  the  number  of  objects  being  tracked  increases,  such  a  distribution  quickly  becomes  intractable. 
An  alternative  method  for  managing  the  distribution  is  provided.  The  authors  develop  a  method  of 
joining  and  splitting  distributions  to  allow  objects  close  together  to  be  tracked  with  a  joint  distribu¬ 
tion  and  objects  separated  by  large  distances  to  be  individually  tracked  by  marginal  distributions. 
This  technique  allows  for  the  particle  filter  to  operate  on  several  simple  distributions  rather  than  one 
complex  distribution. 


Still  other  researchers  have  focused  on  tracking  techniques  that  minimally  impact  the  re¬ 
sources  of  the  WSN.  For  instance,  Zhao  et  al  [97]  propose  that  the  execution  of  general  information 
processing  tasks  (such  as  entity  tracking)  is  realized  as  the  solution  to  an  optimization  problem. 
The  proposed  objective  function  incorporates  quantities  such  as  information  gain  and  communica¬ 
tion  costs.  This  technique,  specialized  for  entity  tracking,  asserts  that  the  estimated  tracks  are  only 
updated  as  necessary,  minimizing  power  and  bandwidth  utilization. 

Entity  tracking  within  sensor  networks  is  indeed  an  active  research  area  encompassing 
many  scenarios  and  solutions.  As  already  discussed,  the  focus  of  this  work  is  on  minimizing  net¬ 
work  resources,  tracking  multiple  objects,  or  distributed  information  sharing.  The  focus  of  our 
work,  on  the  contrary,  is  on  the  localization  of  a  cooperative  mobile  robot.  Hence,  the  current  lit¬ 
erature  lacks  an  appropriate  solution  for  our  scenario:  multiple  object  tracking  algorithms  are  too 
general,  distributed  information  sharing  architectures  are  not  necessary,  and  low  resource  utilization 
is  not  our  focus.  Furthermore,  the  cooperative  mobile  robot  localization  problem  is  decidedly  dif¬ 
ferent  than  the  researched  tracking  scenarios:  detections  of  our  cooperative  agent  only  occur  locally 
and  can  be  immediately  sent  (via  broadcast)  to  the  agent  which  assumes  the  role  of  a  centralized 
estimator. 

Before  leaving  the  sensor  network  estimation  literature,  we  expand  our  scope  from  track¬ 
ing  to  localization  and  general  estimation  techniques.  In  this  realm,  researchers  have  studied  many 
specialized  localization  applications  such  as  sniper  localization  [83]  and  sensor  node  localiza¬ 
tion  [93].  However,  these  applications  are  tailored  for  different  sensor  modalities  and  estimation 
of  static  quantities  which  are  not  applicable  to  the  mobile  robot  localization  problem.  The  mobile 
robot  estimation  problem  within  sensor  networks  has  had  limited  coverage.  In  particular-,  Sinop- 
oli  et  al  [84]  consider  a  linear  control  system  with  Kalman  filter  estimation  supplied  with  sensor 
network  detections.  The  sensor  network  is  modeled  as  a  monolithic,  distributed  sensor  that  either 
delivers  a  measurement  on  time  or  drops  it.  The  authors  show  a  relationship  between  the  probability 
of  dropping  a  sensor  reading,  the  system  dynamics,  and  the  system  stability.  This  work  provides 
insight  into  the  amount  of  network  losses  tolerated  by  a  stable  feedback  control  system.  However, 
the  sensor  network  model  is  rather  limited.  It  does  not  model  network  latency  characteristics,  and, 
hence,  has  no  method  to  incorporate  late  arriving  measurements  into  the  Kalman  filter.  In  contrast, 
our  work  does  not  provide  theoretical  stability  bounds,  but  instead  provides  efficient  solutions  for 
more  realistic  and  practical  sensor  network  scenarios. 

The  current  sensor  network  estimation  literature  does  not  directly  address  our  application, 
but  does  provide  an  implicit  suggestion  for  basic  localization  techniques.  At  the  root  of  these  track- 
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ing  and  localization  algorithms,  are  Bayesian  estimation  techniques  (or  approximations  thereof). 
To  provide  more  specific  insight  into  applying  Bayesian  estimation,  we  turn  to  the  mobile  robot 
localization  literature. 

A  large  body  of  research  on  mobile  robot  localization  has  been  carried  out.  Classical 
techniques  such  as  Kalman  filtering  (KF)  [50]  and  Markov  localization  (ML)  [36]  have  been  heavily 
investigated.  Combinations  and  variants  of  these  techniques  have  also  been  explored:  extended 
Kalman  filtering  (EKF)  [54],  Markov  localization-extended  Kalman  filtering  (ML-EKF)  [41],  and 
multi-hypothesis  tracking  (MHT)  [48,  5].  Additionally,  a  wide  variety  of  sequential  Monte  Carlo 
methods,  or  particle  filters  [32],  have  been  explored:  sensor-resetting  localization  [53],  mixture 
Monte  Carlo  localization  [87],  and  adaptive  Monte  Carlo  localization  [27].  Comparisons  of  these 
localization  techniques  [42,  43]  have  been  performed.  In  fact,  any  of  these  techniques  are  candidates 
for  interpreting  our  sensor  network  data.  However,  the  focus  of  our  work  is  to  develop  techniques 
that  provide  an  estimator  with  the  best  information.  In  the  case  of  mobile  robot  navigation,  we  show 
that  Markov  localization  is  a  particularly  good  choice.  Since  this  body  of  literature  tends  to  assume 
a  traditional  robot  model  with  on-board,  high  speed  sensors  (such  as  GPS  and  laser  range  finders), 
these  techniques  are  not  adapted  to  the  data  characteristics  of  sensor  networks.  Fortunately,  the 
mobile  robot  literature  proves  more  fruitful  as  we  move  from  estimation  to  control  (navigation). 

Mobile  robot  navigation  and  control  is  studied  from  a  variety  of  perspectives.  As  we 
explore  the  navigation  literature,  we  begin  with  topics  directly  related  to  sensor  networks.  Then, 
we  expand  our  search  to  include  generalized  path  planning  topics  and  adapted  classical  control 
techniques. 

Robot  navigation  has  been  investigated  in  the  context  of  an  autonomous  robot  deploying 
a  sensor  network  and  using  it  for  coverage,  exploration,  and  navigation  [8,  9,  7],  In  this  work,  each 
node  periodically  emits  a  message  to  nearby  receivers.  As  a  robot  moves  about,  it  selects  the  node  it 
has  recently  received  the  most  messages  from  as  its  current  node.  If  no  messages  have  been  received 
recently,  the  robot  deploys  a  new  node  and  selects  it  as  the  current  node.  The  current  node  is  used 
as  the  robot’s  location.  During  the  sensor  network  deployment  phase,  the  robot  and  network  build  a 
graph  that  probabilistically  maps  from  the  set  of  current  robot  locations  and  control  actions  to  the  set 
of  possible  next  robot  locations.  During  coverage  and  exploration,  nodes  track  which  neighboring 
areas  have  least  recently  been  explored  and  direct  nearby  robots  to  these  areas.  During  navigation, 
the  sensor  network  assigns  a  utility  value  to  each  node  representing  the  likelihood  of  a  robot  nearby 
this  node  reaching  the  desired  destination.  The  utility  values  across  the  network  are  updated  using 
value  iteration.  The  robot  navigates,  node  by  node,  to  the  destination  by  iteratively  navigating  to 
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nearby  nodes  with  the  highest  utility  value.  Although  this  system  can  explore  and  navigate  areas, 
the  graph  it  builds  is  not  based  on  geometric  measures,  but  rather  received  radio  signal  strength. 
This  hinders  the  technique’s  ability  to  be  combined  with  other  sensing  platforms  such  as  laser  range 
finders  or  GPS.  This  also  makes  it  difficult  for  a  robot  to  adapt  to  its  environment  by  learning  its 
dynamic  parameters  such  as  wheel  slippage.  A  final  drawback  to  this  approach  is  that  value  iteration 
is  slow  to  converge  making  it  impractical  for  navigation  with  frequently  changing  way -points  (such 
as  pursuit-evasion  games). 

Robot  navigation  in  sensor  networks  has  been  considered  by  other  researchers  [74,  56]. 
In  this  work,  motes  are  placed  at  known  locations  and  are  capable  of  detecting  danger.  A  system 
for  navigating  the  network  and  avoiding  danger  is  developed.  Each  mote  and  its  neighboring  region 
is  assigned  a  danger  level  based  on  how  many  radio  hop  counts  it  is  away  from  danger,  implicitly 
building  up  a  connectivity  based  graph  for  robot  guidance.  This  information  is  used  by  an  artificial 
potential  field  algorithm  to  guide  the  robot.  This  work  assumes  that  the  robot  can  localize  itself 
to  the  extent  of  determining  which  nodes  are  nearby.  Furthermore,  it  is  assumed  that  the  robot  is 
capable  of  navigation  to  any  desired  (nearby)  node. 

Other  researchers  [22,  24,  23]  investigated  the  ability  of  a  unmanned  aerial  vehicle  (UAV) 
to  deploy,  repair,  localize,  and  navigate  a  sensor  network.  The  UAV  is  capable  of  autonomous  way- 
point  navigation  using  an  on-board  computation  and  sensing  system  (including  a  GPS  sensor).  The 
goal  for  the  sensor  network  deployment  and  repair  phase  is  to  create  and  maintain  a  desired  network 
topology.  The  network  is  configured  to  pass  messages  around  in  order  to  detect  connectivity  holes. 
Once  holes  are  identified,  the  UAV  is  dispatched  to  the  region  to  deploy  new  motes.  During  network 
localization,  the  UAV  flies  a  pattern  above  the  sensor  network  broadcasting  its  GPS  coordinates. 
Motes  receiving  these  messages  infer  their  locations  in  GPS  coordinates  using  received  radio  signal 
strength.  For  navigation,  an  a  priori  generated  path  in  GPS  coordinates  is  flooded  across  the  network 
and  stored  by  motes  near  or  on  the  path.  The  UAV  follows  this  path  by  receiving  partial  path 
information  messages  from  nearby  motes.  This  navigation  technique  amounts  to  an  elaborate  path 
dissemination  algorithm. 

Das  et  al  [28]  consider  a  search  and  rescue  system  using  mobile  robots  and  sensor  nodes. 
Nodes  are  deployed  to  loosely  known  locations  in  a  map.  The  authors  propose  that  both  the  nodes 
and  the  robot  can  be  localized  using  traditional  simultaneous  localization  and  mapping  (SEAM)  [55] 
techniques.  In  particular,  it  is  proposed  that  a  Kalman  filter  is  provided  received  signal  strength  mea¬ 
surements  of  messages  passed  amongst  robots  and  motes.  Navigation  is  achieved  by  first  planning  a 
path  along  a  string  of  nodes  from  the  current  position  to  the  goal  position.  The  network  chooses  the 


7 


string  of  nodes  by  passing  messages  around  and  achieving  a  dynamic  programming  type  solution. 
The  robot  then  follows  the  planned  path  using  a  gradient  decent  routine. 

Our  survey  could  additionally  broaden  its  scope  to  more  generalized  sensor  networks 
such  as  camera  networks.  However,  the  research  in  these  areas  is  not  applicable  to  our  SNAC  sys¬ 
tem  due  to  differences  in  the  platform’s  characteristics  (such  as  fast,  reliable  networks  with  slow 
robot  dynamics  [47])  or  differences  in  the  estimation  architecture  (such  as  switching  amongst  sen¬ 
sors  [52]  rather  than  multiple  sensor  measurement  fusion).  The  literature  on  mobile  robot  control 
and  navigation  with  sensor  networks  is  quite  sparse.  Techniques  that  are  relevant  for  our  platform 
use  non-geometric  based  graphs  for  navigation.  However,  our  work  seeks  to  develop  a  geometric 
localization  and  navigation  technique  allowing  integration  with  traditional  sensors  (such  as  GPS, 
laser  range  finders),  admitting  robot  learning  (such  as  learning  the  robot’s  wheel  slippage),  and  en¬ 
abling  fine  grained  control.  Furthermore,  previous  navigation  techniques  based  on  received  radio 
signal  strength  are  suited  to  only  specialized  environments  (such  as  simple  indoor  environments 
with  controlled  radio  connectivity). 

The  mobile  robot  community  has  extensively  investigated  navigation  and  control.  How¬ 
ever,  the  focus  is  typically  on  topics  non-applicable  to  our  work  such  as  obstacle  avoidance,  map 
building,  or  coordinated  movement.  Fortunately,  previous  work  on  intelligent  path  planning  is  help¬ 
ful.  Makarenko  et  al  [63]  consider  the  problem  of  integrated  exploration  entailing  path  planning, 
navigation,  and  localization.  The  authors  assume  a  traditional  robot  platform  utilizing  SLAM  [55] 
techniques  with  an  extended  Kalman  filter  for  mapping  and  robot  localization  combined  with  an 
occupancy  gird  for  obstacle  avoidance.  In  particular,  the  robot  movement  algorithm  proceeds  by 
suggesting  several  candidate  destinations,  choosing  a  destination,  and  navigating  to  the  chosen  des¬ 
tination.  The  candidate  destinations  are  suggested  for  improved  exploration.  A  single  destination  is 
found  as  the  solution  to  an  optimization  problem  over  localization  accuracy,  information  gain,  and 
travel  time.  Finally,  the  robot  navigates  to  the  destination  by  planning  a  path  that  avoids  obstacles. 

Our  work  focuses  on  navigating  not  for  obstacle  avoidance,  but  for  improved  localiza¬ 
tion.  Hence,  both  our  work  and  the  Makarenko  et  al  work  exploits  a  localizability  metric  to  gauge 
a  position’s  ability  to  accurately  localize  the  robot.  However,  our  metric  is  adapted  for  a  distributed 
sensing  environment  and  is  optimized  at  every  point  along  the  path,  rather  than  at  just  a  few  desti¬ 
nations.  Additionally,  since  our  focus  is  on  sensor  networks,  it  will  become  clear  that  the  Gaussian 
noise  model  implicit  with  Kalman  filtering  is  inappropriate.  Instead,  our  work  develops  a  special¬ 
ized  sensor  model  for  use  with  Markov  localization.  Finally,  the  development  of  our  localizability 
metric  is  different  from  the  one  used  by  Makarenko  et  al.  In  particular,  the  metric  our  work  uses  is 


provided  in  the  work  by  Roy  et  al  [78],  but  is  adapted  to  our  distributed  sensor  network  model.  There 
are  a  few  distinctions  between  the  Makarenko  et  al  and  the  Roy  et  al  localizability  metric.  In  par¬ 
ticular,  the  former  assumes  infinite  initial  covariance  and  infinite  readings  at  each  position,  whereas 
the  later  assumes  a  well-defined  prior  with  finite  covariance  and  assumes  an  expected  number  of 
readings  at  each  position.  Our  work  will  clarify  why  an  infinite  initial  covariance  is  inappropriate 
for  the  multiple,  local  sensor  platform  of  a  WSN. 

Earlier  work  [86]  utilizing  a  localizability  metric  for  path  planning  has  also  been  carried 
out.  This  work  assumes  a  robot  platform  with  a  ranging  sensor  and  Kalman  filtering  (similar  to 
the  work  by  Makarenko  et  al),  and  a  localizability  metric  consistent  with  the  work  by  Roy  et  al. 
Unfortunately,  this  work  concluded  with  hand  generated  path  producing  less  drift  during  navigation 
than  a  path  based  on  the  localizability  metric. 

Moving  away  from  the  mobile  robot  navigation  and  control  literature  and  entering  the 
classical  control  arena,  our  survey  finds  more  practical  tips.  Some  researchers  [64,  65]  developed 
real-time  compensation  techniques  for  networked  control  systems.  Their  work  describes  a  technique 
that  compensates  for  sensing  and  actuation  jitter  for  linear,  discrete-time  systems.  The  authors 
assume  the  jitter  is  known  at  run-time  and  develop  an  algorithm  that  simulates  the  system  forward 
in  time  and  recomputes  the  control  action.  Although  this  technique  is  only  described  for  linear, 
discrete  time  systems,  our  research  applies  this  idea  to  an  extended  Kalman  filter  to  correct  for 
network  latency. 

Predictive  control  techniques  are  also  useful  for  incorporating  several  goals  and  dealing 
with  missing  or  late  sensor  measurements  typical  of  sensor  networks.  For  instance,  Shim  et  al  [82] 
develop  a  model  predictive  control  (MPC)  framework  for  navigation  of  aerial  vehicles.  The  authors 
use  a  MPC  to  incorporate  several  goals  such  as  obstacle  avoidance  and  way-point  navigation.  Using 
a  finite  time  horizon,  the  MPC  is  able  to  plan  locally  optimal  navigation  routes.  Additionally,  such 
a  framework  is  able  to  continue  operating  using  only  predicted  results  during  periods  of  missing 
measurements.  Our  general  control  architecture  will  avail  itself  of  this  technique. 

Before  leaving  behind  the  literature  on  control  and  sensor  networks,  we  mention  one  final 
result  for  the  interested  reader.  In  a  paper  by  Ye  et  al  [96],  the  effects  of  communication  protocols 
on  control  systems  are  investigated  using  realistic  network  simulations.  The  authors  consider  a 
mobile  robot  cooperation  application  where  communication  between  robots  is  simulated  using  the 
Network  Simulator  [12].  It  is  demonstrated  that  a  communication  scheme  with  less  frequent,  but 
longer  packets  is  superior  to  one  with  more  frequent,  but  smaller  packets  for  their  application.  This 
counter-intuitive  result  reminds  us  that  more  research  is  needed  on  networked  control  systems. 
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After  a  brief  overview  of  the  literature  relevant  to  sensor  network  and  control  systems  - 
and,  in  particular,  to  robot  navigation  -  we  find  that  many  topics  have  not  be  addressed.  In  particular, 
since  sensor  networks  is  a  new  research  area,  few  researchers  have  considered  their  use  for  control 
or  robot  navigation.  Additionally,  since  only  a  few  medium  to  large  scale  sensor  networks  have  been 
deployed,  our  knowledge  of  practical  challenges  is  limited.  Finally,  to  the  best  of  our  knowledge, 
only  one  medium  to  large  scale  SNAC  system  has  been  deployed.  This  system  was  our  pursuit- 
evasion  game  system  and  is  discussed  in  the  next  chapter. 

1.2  Dissertation  Contributions 

Our  work  differs  from  previous  work  in  several  ways.  The  goal  of  our  work  is  to  de¬ 
velop  a  practical  design  methodology  for  realistic  SNAC  systems.  To  this  end,  our  work  allows 
for  complex  models  of  the  WSN  and  the  control  system,  and  our  work  focuses  on  demonstrating 
good  performance  rather  than  finding  theoretical  bounds  on  it.  We  are  motivated  by  the  fact  that 
the  theoretical  work  tends  to  be  too  specialized  to  be  useful  for  real  deployments,  and  the  practical 
implementations  have  yet  to  adequately  address  fundamental  control  challenges. 

Our  work  achieves  these  goals  by  implementing,  modeling,  and  evaluating  2  representa¬ 
tive  SNAC  applications:  pursuit-evasion  games  and  way-point  navigation.  Our  work  includes  one 
of  the  first,  and  the  largest  SNAC  implementations.  This  implementation  provides  insight  into  the 
difficult  challenges  facing  control  systems  in  sensor  networks  and  aids  in  developing  a  generalized 
SNAC  system  architecture.  This  work  also  leads  us  in  developing  several  realistic  application-level 
WSN  models  and  a  corresponding  simulator.  Using  this  simulator,  we  are  able  to  demonstrate  the 
benefit  of  our  SNAC  system  architecture.  The  analysis  of  this  system  leads  us  to  consider  intelli¬ 
gent  path  planning  for  robot  navigation.  Focusing  on  navigation,  we  adapt  a  localizability  metric 
for  WSNs,  we  create  a  novel  path  planning  routine,  and  we  demonstrate  the  benefit  of  this  ap¬ 
proach.  Our  final  framework  allows  for  simple,  yet  powerful  extensions  to  path  planning  and  robot 
localization  such  as  path  planning  for  explicit  bandwidth  reduction  or  development  of  an  optimal 
re-deployment  strategy.  In  summary,  these  contributions  are 

•  first,  largest  SNAC  implementation 

•  identify  fundamental  SNAC  challenges 

•  develop  a  general  SNAC  system  architecture 


10 


•  develop  practical,  application-level  models  for  WSNs 

•  develop  an  application-level  simulator  for  SNAC  systems 

•  demonstrate  the  benefit  of  our  SNAC  system  architecture 

•  tailor  a  localizability  metric  for  WSNs 

•  develop  and  demonstrate  the  benefit  of  a  novel  path  planning  technique  for  robots  in  WSNs 

•  establish  a  localization  and  path  planning  framework  enabling  simple,  powerful  extensions 

1.3  Thesis  Outline 

Our  work  is  presented  in  the  following  manner.  Chapter  2  presents  our  implementation 
of  a  pursuit-evasion  game  played  within  a  wireless  sensor  network.  First,  the  game  is  introduced 
with  relevant  background  and  related  work.  Then,  the  hardware  and  software  platform  is  described. 
Next,  the  algorithms  for  sensing,  communication,  and  control  are  described.  Results  collected  from 
the  deployment  are  discussed.  Finally,  the  practical  lessons  learned  and  the  challenging  aspects  of 
this  application  are  presented. 

Using  the  results  from  our  PEG  deployment,  Chapter  3  develops  a  practical  system  archi¬ 
tecture  for  general  SNAC  systems.  Three  main  challenge  categories  along  with  potential  solutions 
are  discussed.  This  chapter  concludes  with  a  framework  that  unifies  these  solutions.  Chapter  4  de¬ 
velops  and  uses  a  SNAC  system  simulator.  First,  a  set  of  system  models  are  developed  for  the  WSN 
and  robot  platform.  Then,  several  components  of  the  aforementioned  architecture  are  instantiated. 
Finally,  a  set  of  simulations  are  performed  to  test  our  controller  design  and  evaluate  the  effect  of 
path  planning  on  localization  accuracy. 

The  path  planning  simulations  leads  us  to  adapt  a  localizability  metric  for  WSNs  in  Chap¬ 
ter  5.  This  chapter  begins  with  an  overview  of  robot  localization  and  a  localizability  metric.  Next, 
the  details  of  computing  this  metric  are  discussed  and  a  set  of  simulations  is  performed.  The  next 
chapter,  Chapter  6,  moves  to  adapt  this  metric  for  more  realistic  WSN  models  and  to  begin  testing 
its  usefulness  for  localization  and  path  planning.  Several  simulations  are  provided  to  demonstrate 
the  ability  of  this  metric  to  gauge  localization  performance. 

Building  on  this  result.  Chapter  7  develops  techniques  to  generate  automatic  paths  based 
on  the  localizability  metric.  An  interpretation  of  entropy  for  path  planning  is  given  and  several 
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closed-loop  simulations  are  provided  as  evidence  this  technique  reduces  localization  error.  Finally, 
Chapter  8  reviews  the  results  of  this  work  and  provides  some  concluding  remarks. 
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Chapter  2 

Pursuit-Evasion  Game 


To  date,  we  have  implemented  several  sensor  network  and  control  (SNAC)  systems,  rang¬ 
ing  from  light  monitoring  and  tracking,  to  RC  car  tracking  with  a  pan-tilt  camera,  to  pursuit-evasion 
games  on  outdoor  sensor  networks  [81,  21],  This  chapter  provides  an  overview  of  the  July  2003 
pursuit-evasion  game  implementation,  providing  enough  detail  to  serve  as  a  point  of  reference  for 
later  discussions  on  challenges  and  solutions  for  SNAC  systems.  Our  pursuit-evasion  game,  or 
PEG,  is  a  distributed  network  of  wireless  sensors  that  aids  a  cooperative  agent  (pursuer)  in  tracking 
an  uncooperative  agent  (evader).  These  games,  in  various  forms,  have  been  extensively  studied. 
Researchers  [45,  44]  have  investigated  the  game  theoretic  optimal  strategies  developing  Nash  so¬ 
lutions.  For  continuous  (polygonal)  regions,  other  researchers  [95]  have  determined  bounds  on  the 
number  of  pursuers  needed  for  capture.  For  games  occurring  on  a  graph,  researchers  [72]  have  stud¬ 
ied  how  to  search  for  evaders,  and  others  [1]  have  studied  the  number  of  pursuer  needed  to  catch  an 
evader.  For  a  distributed  sensing  platform,  Demirbas  et  al  [29]  have  studied  pursuit-evasion  games 
on  graphs  whose  vertices  are  covered  by  sensor  nodes.  The  authors  assume  that  communication  is 
symmetric  and  induces  a  fully  connected  graph.  Additionally,  they  assume  that  sensor  nodes  can 
exactly  detect  the  pursuer  and  evader  when  the  robots  are  at  the  node.  Finally,  they  assume  that  a 
pursuer  robot  can  a  priori  navigate  between  nodes.  The  authors  develop  a  tunable  pursuit  algorithm 
that  allows  for  a  trade-off  between  capture  time  and  energy  efficiency.  The  algorithm  functions  by 
having  the  network  maintain  a  tree  rooted  at  the  evader.  The  tree  is  updated  with  recent  detections 
of  the  evader.  Our  work,  considers  the  practical  challenges  for  implementations  of  pursuit-evasion 
games  on  sensor  networks.  Other  work  [91,  51]  implementing  pursuit-evasion  games  has  focused 
on  different  platforms,  considering  vision  based  robots  or  multiple  pursuer  policy  choices.  PEG 
provides  a  real  world,  medium  scale  (larger  pure  sensing  systems  [21,3]  have  been  deployed  since) 
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SNAC  system  deployment  from  which  we  learn  simple,  effective,  pragmatic  design  and  modeling 
methodologies.  PEG  provides  an  extensive  SNAC  benchmark  requiring  solutions  to  many  chal¬ 
lenges: 

•  detection  and  disambiguation  of  mobile  agents 

•  distributed  coordination  and  sharing  of  mote  resources 

•  in-network  processing  and  aggregation 

•  routing  to  mobile  agents 

•  closed-loop  control 

The  services  developed  to  confront  these  challenges  are  developed  with  the  whole  system  in  mind 
to  reduce  overall  latency  and  provide  the  pursuer  the  ability  to  react  in  real-time. 

Being  one  of  the  first  medium  scale,  distributed  tracking  implementations  using  resource 
constrained  wireless  sensors,  PEG  provides  us  with  a  road  map  of  the  challenges  ahead,  new  ideas 
from  the  lessons  learned,  and  a  grocery  list  of  tools  needed.  The  hardware  and  software  platform 
along  with  the  basic  information  flow  is  introduced  in  Section  2.1.  Section  2.2  provides  a  detailed 
overview  of  the  algorithms  used.  The  results  are  summarized  in  Section  2.3.  Finally,  Section  2.4 
discusses  the  lessons  learned  from  PEG.  In  later  chapters,  the  experience  with  PEG  informs  an 
exploration  of  the  challenges  control  systems  face  along  with  providing  a  starting  point  for  devel¬ 
oping  SNAC  models  and  simulators.  More  details  of  the  entire  PEG  deployment  is  presented  in  our 
published  work  [81]. 

2.1  Platform 

This  section  describes  the  hardware  and  basic  software  architecture  used  for  PEG.  PEG  is 
played  on  a  20  meter  by  20  meter  outdoor  field  with  121  motes  and  2  ground  robots.  The  motes  are 
uniformly  deployed  in  an  1 1  by  1 1  grid  with  2  meter  spacing.  The  robots  are  sequentially  released 
onto  the  field:  first  the  evader  enters,  then  a  few  seconds  later,  the  pursuer  enters.  A  base  station 
outside  the  field  monitors  and  displays  system  statistics  online.  In  the  following,  we  describe  the 
mote,  robot,  and  base  station  hardware  along  with  the  basic  flow  of  information  amongst  them. 

The  motes  used  for  PEG  are  the  Berkeley  Mica2Dot’s  [46,  88]  shown  in  Figure  2.  la.  They 
are  equipped  with  a  wireless  radio,  ultrasonic  transceiver,  magnetometer,  and  outdoor  enclosure. 
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(a)  Mica2Dot  mote.  (b)  Pioneer  robot. 


Figure  2.1:  Hardware  used  during  the  July  2003  pursuit-evasion  game  deployment. 

The  computational  resources  are  provided  by  the  8-bit  4  MHz  Atmel  ATMEGA128L  CPU  with  128 
kB  of  instruction  memory  and  4  kB  of  RAM.  The  on-board  radio,  the  Chipcon  CC1000  operating 
at  1  GHz,  provides  about  2  kB/s  of  shared  network  bandwidth  for  applications  after  accounting 
for  network  overhead.  The  communication  range  varies  heavily  depending  on  the  antenna  and  the 
environment;  during  PEG,  with  the  use  of  a  quarter  wave  length  antenna  and  the  mote  a  few  inches 
off  a  level  playing  field  (comprised  of  dirt  and  short  grass),  our  maximum  communication  range  was 
about  30  meters.  The  ultrasonic  transceiver  operates  at  25  kHz  and  uses  a  top  mounted  deflector 
cone  to  provide  about  2  meters  of  ranging  in  the  plane  of  the  playing  field.  The  magnetometer  is 
tuned  to  detect  changes  in  the  magnitude  of  magnetic  field  near  the  mote;  it  detects  our  robots  up 
to  about  3-4  meters  away.  The  hardware  is  mounted  in  an  outdoor  enclosure  with  a  flexible  base 
allowing  for  extended  communication  range  and  resilience  to  robot  impact.  Finally,  the  motes  run 
the  small,  embedded  TinyOS  [46,  88]  operating  system 

The  evader  and  pursuer  robots  are  identical  outdoor,  four-wheeled  robots  (Figure  2.1b) 
equipped  with  mobile  computers,  wireless  radios,  and  GPS  units.  The  on-board  computer  is  a  266 
MHz  Pentium2  CPU  with  128  MB  of  RAM  and  a  20  GB  hard  drive  running  Linux.  Each  robot  uses 
an  802.1 1  wireless  radio  for  communication  to  the  base  station.  The  GPS  unit  provides  the  robot’s 
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(a)  Mote  data  flow. 


(b)  Robot  data  flow. 


Figure  2.2:  Flow  of  information  (magnetic  detections  and  position)  during  the  pursuit-evasion  game 
deployment. 

position  to  within  2  cm  every  0. 1  seconds.  Each  robot  has  a  top  speed  of  about  1  m/s  and  a  motor 
control  subsystem  capable  of  independently  controlling  the  velocity  of  each  of  the  four  wheels.  The 
evader  robot  is  driven  remotely  by  a  person,  and  the  pursuer  robot  is  controlled  autonomously  by 
on-board  control  software.  The  base  station  uses  a  802.11  wireless  link  to  receive  robot  controller 
statistics  and  a  high-gain  mote  antenna  to  snoop  the  mote  network. 

The  final  software  architecture  contains  many  application  services  related  to  entity  de¬ 
tection  and  pursuit  (localization,  sensing,  routing,  state  estimation,  interception  planning,  etc)  and 
even  more  services  supporting  these  (ranging,  tree  building,  leader  election,  etc).  We  provide  a  brief 
overview  of  only  those  services  directly  along  the  entity  detection  and  pursuit  path.  A  flow  chart  of 
these  services,  split  between  the  sensor  network  and  robot  hardware,  is  shown  in  Figure  2.2. 

The  mote  data  flow,  shown  in  Figure  2.2a,  is  designed  to  provide  detection  reports  to  the 
pursuer.  Since  these  reports  relay  the  location  and  strength  of  a  magnetic  detection,  a  localization 
service  and  entity  detection  service  are  fundamental.  The  localization  service  utilizing  the  ultra¬ 
sound  transceiver  (and  sub-services  such  as  ranging)  is  responsible  for  localizing  the  mote  relative 
to  an  anchor  mote  in  the  network.  Due  to  sporadic  errors  of  this  service,  the  motes  used  a  position 
value  provided  by  the  user  during  the  PEG  trials.  The  implementation  and  applicability  of  the  auto¬ 
matic  localization  service  is  further  explored  in  other  work  [93].  The  entity  detection  service  filters 
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measurements  from  the  magnetometer  and  reports  robot  detections.  The  information  collected  by 
the  localization  and  entity  detection  service  is  stored  and  shared  by  the  neighborhood  service.  This 
service  provides  a  distributed,  loosely  synchronized  storage  system  allowing  neighboring  motes  to 
share  information.  Using  the  information  stored  by  this  service,  the  detection  aggregation  service 
compiles  location  and  detection  information  to  be  sent  to  the  pursuer.  Finally,  the  routing  service  is 
responsible  for  dispatching  messages  amongst  motes  and  the  mobile  pursuer. 

The  robot  data  flow,  shown  in  Figure  2.2b,  is  designed  to  interpret  the  detection  reports 
from  the  mote  network  and  guide  the  pursuer  to  the  evader.  The  pursuer  receives  entity  detection 
reports  over  its  mote  radio  and  pursuer  position  information  over  the  GPS  channel.  Using  this 
information,  the  pursuer  estimates  the  state  (position,  orientation,  etc)  of  itself  and  the  evader.  The 
interception  planning  service  determines  the  path  the  pursuer  should  follow  in  order  to  efficiently 
capture  the  evader.  Finally,  the  navigation  service  interfaces  with  the  motors  and  ensures  the  desired 
path  is  followed. 

2.2  Algorithms 

This  section  describes  the  algorithms  making  up  the  aforementioned  services.  Before 
describing  the  flow  of  detection  events  within  the  sensor  network  or  the  pursuer,  we  investigate  the 
mechanism  that  links  these  two  systems:  the  routing  layer.  Then,  starting  with  the  mote  platform,  we 
explain  the  mechanisms  that  sense  entities,  share  detections,  and  aggregate  readings.  Next,  turning 
to  the  robot  platform,  we  discuss  how  detections  are  processed,  how  the  system  state  is  estimated, 
how  an  interception  path  is  planned,  and  how  this  path  is  followed. 

The  mote  routing  layer  is  responsible  for  delivering  messages  from  many  (motes)  to  few 
(mobile  agents).  Using  landmark  routing  [89],  we  partition  the  routing  service  into  2  pieces:  route 
from  many  to  a  landmark,  and  route  from  a  landmark  to  few.  Landmark  routing  requires  that  the 
user  chooses  a  landmark  and,  using  the  landmark  as  a  root  node,  builds  a  spanning  tree.  A  spanning 
tree  is  built  by  flooding  the  network  with  a  beacon  that  tracks  hop  count.  Each  mote  will  choose  its 
parent  as  the  mote  whom  it  heard  broadcast  the  beacon  with  the  lowest  hop  count.  However,  this 
approach  is  vulnerable  to  generating  asymmetric  links  and  back  edges. 

Asymmetric  links  are  generated  because  a  mote  does  not  verify  the  bidirectional  reliabil¬ 
ity  of  the  link  between  itself  and  its  parent.  We  address  this  issue  by  filtering  out  messages  that 
originate  from  an  unreliable  link.  In  particular,  if  the  message  is  received  with  a  received  signal 
strength  indicator  (RSSI)  below  a  preset  threshold,  or  from  a  mote  located  too  far  away  according 
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to  localization  information,  the  packet  is  dropped.  The  RSSI  threshold  is  set  to  a  value  that  provides 
a  highly  reliable  link  based  on  small  scale  experiments  using  the  same  platform. 

Back  edges  are  caused  by  frequent  message  collision  amongst  motes  re-broadcasting  the 
beacon.  These  collisions  cause  a  nearby  mote  to  choose  a  parent  whose  beacon  messages  did  not 
collide,  often  resulting  in  a  higher  hop  count  parent  than  necessary.  This  issue  is  addressed  by 
requiring  that  each  receiving  mote  waits  a  random  amount  of  time  less  than  an  upper  threshold 
Tb  before  broadcasting  the  beacon.  Additionally,  if  another  beacon  is  heard  before  a  mote  has 
broadcasted,  it  must  choose  another  time  tb  <  Tb  and  begin  waiting  again. 

Once  a  spanning  tree  is  built,  any  mote  can  route  to  the  landmark.  Routing  to  the  mobile 
agent  is  achieve  by  building  a  crumb  trail.  The  mobile  agent  periodically  broadcast  a  beacon  to 
a  nearby  mote.  Using  the  spanning  tree  already  in  place,  this  beacon  is  routed  to  the  landmark. 
Along  the  way,  each  connecting  mote  is  left  with  a  crumb,  the  identity  of  the  previous  mote  during 
this  communication.  Any  message  at  the  landmark  can  now  be  routed  back  to  the  mobile  agent  by 
reversing  the  crumb  trail.  Additionally,  crumbs  are  associated  with  a  timeout  and  are  erased  from 
motes  when  they  become  stale.  With  the  routing  service  in  place,  we  turn  to  the  mote’s  internal 
operations. 

The  detection  sequence  at  the  mth  mote  begins  with  the  entity  detection  component.  This 
component  is  fed  measurements  of  the  absolute  magnetic  field  via  the  magnetometer.  Due  to  drift 
in  the  magnetometer,  the  raw  magnetic  field  readings  B™aw  are  high  pass  filtered  to  generate  B"' .  If 
the  processed  value  Bm  is  above  a  preset  threshold  Breport,  Bm  along  with  the  mote’s  position  zm  is 
broadcast  to  neighboring  motes  via  the  neighborhood  service.  After  which,  the  mote  must  wait  for 
Treport  seconds  before  reporting  another  reading.  We  refer  to  these  single  magnetometer  reading 
packets  as  detection  messages. 

The  local  neighborhood  information  sharing  implementation  used  was  Hood  [94] .  Hood 
operates  by  broadcasting  local  updates  to  the  state  disregarding  whom  its  neighbors  are  or  if  they 
can  hear  it.  Motes  who  hear  a  Hood  broadcast  and  consider  the  broadcasting  mote  to  be  a  neigh¬ 
bor,  update  their  local  state.  For  our  purposes,  Hood  was  configured  to  form  a  magnetic  detection 
neighborhood  with  radius  R]lood ■  Additionally,  this  neighborhood  was  set  to  prune  detections  older 
than  Thood  seconds. 

The  mote  constantly  monitors  the  neighborhood  state,  and,  if  it  has  the  largest  magnetic 
detection  of  any  of  its  neighbors,  the  mote  elects  itself  as  a  leader.  As  a  leader  mote,  it  aggregates 
all  the  detection  readings  it  received  in  the  last  Treport  seconds  into  one  packet  and  sends  this  via  the 
routing  service  to  the  pursuer.  We  refer  to  these  aggregated  magnetometer  reading  packets  as  event 
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messages.  Once  a  mote  is  a  leader,  it  must  wait  Tjea(jer  seconds  before  becoming  a  leader  again. 

When  the  pursuer  receives  an  event  message,  it  updates  its  estimate  of  the  system  state, 
and  using  this  estimate,  it  tracks  the  evader.  The  block  diagram  for  this  process  is  shown  in  Fig¬ 
ure  2.3.  Two  coordinate  systems  are  used  by  the  pursuer:  GPS  coordinates  relative  to  a  nearby 
landmark  and  mote  network  coordinates  aligned  with  the  mote  deployment  grid.  Details  of  these 
coordinate  systems  are  not  important  since  a  fixed  homogeneous  coordinate  transformation  between 
the  two  systems  is  assumed  to  be  known.  Unless  explicitly  stated  by  using  a  GPS  subscript,  all  quan¬ 
tities  are  in  terms  of  the  mote  coordinate  system.  The  pursuer  receives  (IgpS,  fl’.pj.  an  estimate  of  its 
field  position  in  GPS  coordinates,  which  is  transformed  into  the  pursuer  position  estimate  (lp,] ,  lp-2') 
in  mote  coordinates. 

Many  techniques  [77,  90]  exist  for  tracking  and  estimating  moving  entities.  Flowever, 
the  choice  of  such  algorithms  is  limited  by  the  robot  dynamics  and  sensors.  In  particular,  given 
the  accuracy  of  the  GPS,  the  pursuer’s  state  is  estimated  using  simple  averaging  techniques.  More 
specifically,  (lPi1,  lp’2)  are  smoothed  with  a  Nf  step  time  window  average  to  produce  the  position 
estimate  (P,p,  l2,p).  Then,  the  Np  most  recent  position  estimates  are  used  (pairwise)  to  form  several 
orientation  estimates,  which  are  averaged  to  form  the  orientation  estimation  6P. 

The  pursuer  estimates  the  evader’s  state  using  received  event  messages.  First,  each  de¬ 
tected  position  (F1,/7  2)  is  classified  as  being  caused  by  noise,  the  pursuer,  or  the  evader.  After 
which,  only  evader  detections  are  kept  and  sent  to  the  evader  state  estimation  service.  Several  tech¬ 
niques  exists  [71,  15,  33,  40]  for  classifying  and  tracking  objects  in  a  sensing  region.  For  estimation 
of  the  evader,  we  note  that  since  our  robots  can  change  their  wheel  speed  within  about  0.2  seconds, 
and  the  network  only  reports  detections  about  every  1-3  seconds,  we  can  use  a  simple  kinematic 
model  accounting  only  for  a  robot’s  maximum  speed  smax: 

lk+ 1  —  F  +  {SmaxT)Uk  (2.1) 

where  smax  >  0,  the  control  tq  e  [-1,  l]2,  and  the  sampling  period  T  >  0.  With  our  model,  we 
associate  measurements  close  to  the  evader  with  the  evader,  and  using  an  average  of  these  mea¬ 
surements,  we  estimate  the  evader’s  position.  In  particular,  assuming  that  the  error  in  any  detected 
position  may  be  as  much  as  dn  (according  to  the  2-norm),  and  that  the  capture  radius  dc  is  larger 
than  this  error,  detections  within  dn  of  the  pursuer’s  estimated  location  are  ignored.  These  messages, 
if  not  detections  of  the  pursuer  or  noise,  are  detections  of  a  captured  evader.  Furthermore,  any  de¬ 
tections  farther  than  2 dn  +  smaxt  from  the  evader’s  last  estimated  position,  are  considered  noise  and 
ignored  where  t  is  the  time  since  the  last  measurement.  All  remaining  messages  are  assumed  to  be 
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Figure  2.3:  The  pursuer’s  estimation  and  pursuit  control  system. 
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Figure  2.4:  Stills  from  a  26  second  pursuit-evasion  sequence.  The  pursuer  and  evader  start  in  the 
upper  left  and  lower  right  corners,  respectively.  The  evader  is  captured  in  the  last  frame. 

the  position  detection  of  the  evader  and  are  labeled  (le-1,  le'2).  These  messages  are  used  to  estimate 
the  position  of  the  evader  (te,1,te’2). 

Using  the  estimated  state  of  the  pursuer  and  evader,  the  interception  planner  generates  a 
way -point  (lnav'1,  lnav’2)  for  the  pursuer  to  go  to  in  order  to  intercept  the  evader.  Our  interception 
planner  simply  generated  the  way  point  at  the  evader’s  estimated  position.  The  interception  con¬ 
troller  also  implements  2  safety  specifications:  keep  the  pursuer  inside  the  playing  field  and  do  not 
collide  with  the  evader.  Keeping  the  pursuer  inside  the  playing  field  is  achieved  by  planning  a  path 
to  the  center  of  the  playing  field  if  the  robot  ever  leaves.  Avoiding  collisions  with  the  evader  is 
achieved  by  halting  the  pursuer  when  it  is  within  the  capture  radius  of  the  evader  and  restarting  the 
pursuer  when  the  evader  moves  farther  away.  Finally,  the  robot  is  directed  to  the  new  way-point  via 
the  point  navigation  controller  managing  the  robot  wheel  velocities  a>1,  or,  m3,  to4. 

2.3  Results 

PEG  was  deployed  outdoors  and  many  of  the  parameter  values  were  set  after  online  test¬ 
ing.  In  particular,  Brepor,  was  set  so  most  motes  would  detect  a  robot  within  3-4  meters.  Mote 
neighborhoods  were  set  to  include  a  9  by  9  grid  of  motes  by  letting  Rhood  -  3  meters.  The  timeouts 
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were  chosen  to  be  0.5  seconds:  Treport  =  T iwolj  =  T/t.a/jer  =  0.5  seconds.  Finally,  the  pursuer’s 
position  and  orientation  were  estimated  by  windows  of  length  N?  =  2  and  Nfj  =  4,  respectively. 

PEG  was  ran  a  half-dozen  times,  and,  in  each  run,  the  pursuer  successfully  captured  the 
evader.  One  particular  run  is  shown  in  Figure  2.4.  In  the  first  frame  (upper  left),  the  pursuer 
(robot  with  the  orange  cone  attached)  is  shown  in  the  upper  left  and  is  oriented  down  and  to  the 
left.  The  evader  is  in  the  lower  right  and  is  oriented  facing  right.  As  the  sequence  proceeds,  the 
evader  (driving  in  reverse)  turns  towards  the  bottom  of  the  frame  and  drives  away  from  the  pursuer. 
Meanwhile,  the  pursuer  turns  to  face  the  evader  and  drives  towards  it.  The  last  frame  shows  the 
result:  the  evader  is  captured.  From  start  to  finish,  this  sequence  spans  about  26  seconds. 

Unfortunately,  PEG  was  insufficiently  instrumented  to  capture  the  necessary  data.  Hence, 
we  instrumented  and  re-deployed  PEG  on  a  7  by  7  grid  with  2  meter  spacing.  The  primary  focus 
of  our  re-deployment  was  to  study  the  detection-routing-reaction  chain  (latency,  loss,  noise,  etc). 
To  study  this,  a  single  robot  is  driven  around  in  the  field  and  the  network  activity  is  monitored.  In 
particular,  a  base  station  snoops  on  the  network  packets  and  the  robot’s  GPS  position.  Figure  2.5 
shows  3  views  of  such  a  run:  an  overhead  view  of  mote  detections,  an  overhead  view  of  estimated 
robot  position,  and  a  time  line  of  detection  and  estimation  error.  Figure  2.5a  provides  a  view  of  the 
robot  path  in  the  mote  network.  The  solid  line  is  the  GPS  measured  path  of  the  robot  which  starts  at 
about  (0.4, 7.6)  at  t  =  0  seconds  and  concludes  around  (-0. 1 , 9.6)  at  t  =  145  seconds.  The  robot  path 
is  additionally  demarcated  by  a  square  for  every  20  seconds  of  run  time.  Motes  who  elect  themselves 
as  leaders  and  report  event  messages  are  shown  as  blue  stars.  A  dashed  blue  line  is  drawn  from  each 
leader  to  the  position  of  the  robot  at  the  time  the  event  message  was  received.  Initial  tests  of  this 
deployment  revealed  that  the  motes  at  (4, 10)  and  (4, 12)  were  frequently  reporting  false  detections; 
these  mote  detections  were  disabled  to  prevent  them  from  saturating  the  network,  and,  hence,  are 
not  shown  in  this  figure. 

Sifting  through  this  figure,  we  notice  several  phenomenon.  Short  dashed  lines  tend  to 
indicate  a  good  detection  with  minimal  latency.  For  example,  the  motes  at  (10,  8)  and  (12,  8)  tend 
to  report  the  robot’s  position  reliably  and  quickly.  Slightly  longer  dashed  lines  indicate  a  higher 
latency  communication  link  between  the  leader  mote  and  the  robot.  Fong  dashed  lines,  such  as  the 
one  connecting  to  the  mote  at  (12, 12),  represent  false  detections.  The  motes  at  (12, 12),  (12,0), 
and  (4, 4)  were  misbehaving  and  frequently  reporting  false  detections.  Poor  detections  made  robot 
position  estimation  difficult  as  shown  in  Figure  2.5b.  This  figure  removes  the  leader  indicators  of 
the  previous  figure  and  adds  the  estimated  robot  position  (solid  orange  line).  The  estimated  robot 
position  was  generated  offline  using  a  Kalman  filter  and  the  sensor  network  reports. 
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The  Kalman  filter  assumes  simple  (linear)  point  mass  kinematics  with  no  movement  noise. 
The  sensing  error  covariance  is  tuned  to  the  detection  error  (see  the  blue  dashed  line  in  Figure  2.5c). 
Additionally,  the  Kalman  filter  is  initialized  with  a  perfect  state  estimate.  In  effect,  the  Kalman 
filter  is  simply  acting  like  a  smoothing  function  for  the  detections.  From  Figure  2.5b,  it  is  clear  that 
the  system  is  extremely  noisy  especially  with  the  false  reports.  Figure  2.5c  shows  a  time  line  of 
the  sensor  network  detection  error  (blue  dashed  line)  and  the  Kalman  filter  estimation  error  (solid 
orange  line)  with  detection  receive  times  (light  blue  vertical  lines).  The  detection  error  is  calculated 
as  the  difference  between  the  robot’s  GPS  measured  position  and  the  sensor  network’s  most  recently 
reported  detection.  Clearly,  this  error  is  quite  high.  Often,  the  robot  is  not  even  detected  within  the 
correct  grid  cell  or  even  the  correct  9  by  9  grid  cell  surrounding  the  robot.  In  fact,  the  mean  error 
of  raw  detections  is  2.42  meters,  indicating  that  the  robot  is  often  detected  in  the  wrong  grid  cell. 
Over  the  course  of  the  145  second  journey,  the  robot  receives  48  detections,  providing  an  average 
of  1  detection  every  3.02  seconds. 

Additionally,  as  we  will  discuss  shortly,  we  estimate  the  mean  latency  of  the  network  to 
be  about  1.75  seconds.  Given  this  high  latency,  low  bandwidth,  and  high  error  system,  our  Kalman 
filter  (with  perfect  initial  state  and  zero  movement  noise)  still  generates  1 .96  meters  mean  position 
error;  the  unprocessed  detections  have  3.25  meters  mean  position  error.  Clearly,  a  real  system 
with  movement  noise,  running  a  feedback  loop  on  such  data  would  be  quite  limited  in  speed  and 
accuracy;  compare  this  with  a  10  Hertz,  2  cm  accuracy  GPS  system,  for  instance. 

To  understand  the  intrinsic  quality  of  the  sensor  network  data,  we  more  aggressively  filter 
the  raw  detections  to  generate  the  results  shown  in  Figure  2.6.  In  particular,  we  remove  reports 
from  (additional)  misbehaving  motes  by  silencing  motes  (12, 12),  (12, 0),  and  (4, 4).  To  account  for 
latency,  we  shift  the  reports  backwards  in  time  and  observe  that  a  time  shift  in  the  range  of  1 .6  to 
2.5  seconds  is  helpful  in  reducing  the  detection  error.  We  observe  that  a  time  shift  of  1.75  seconds 
provides  a  minimum  amount  of  detection  error.  After  which,  we  apply  the  same  Kalman  filter  to 
estimate  the  robot  position.  Our  robot  now  only  receives  41  measurements  (about  1  measurement 
every  3.54  seconds),  but  is  able  to  achieve  a  mean  detection  error  of  1.69  meters  (13.78%  less  error 
than  the  previous  Kalman  filter  estimate). 

Over  the  span  of  this  path,  the  detection  error  averages  2.60  meters,  and  1.53  meters  after 
application  of  the  Kalman  filter.  We  observe  that  even  after  applying  our  (non-causal)  filter  to  hand 
picked  measurements,  the  sensor  platform  still  proves  challenging  for  support  of  high  speed,  high 
accuracy  control  systems.  The  next  section  summarizes  the  lessons  learned  during  this  deployment. 
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(a)  GPS  measured  robot  path  (solid  black)  demarcated  ev-  (b)  GPS  measured  robot  path  (solid  black)  demarcated  ev¬ 
ery  20  seconds  (black  squares)  with  leader  motes  (blue  ery  20  seconds  (black  squares)  with  Kalman  filter  esti- 


stars)  and  received  message  times  (blue  dotted  lines). 


mated  position  (solid  orange). 
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(c)  Position  error:  sensor  network  detected  (dashed  blue)  and  Kalman  filter  estimated  (solid  orange). 


140 


Figure  2.5:  Results  collected  from  a  single  robot  traversing  a  7  by  7  sensor  network.  Faulty  motes 
(4, 10)  and  (4, 12)  have  been  manually  silenced. 
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(a)  GPS  measuied  lobot  path  (solid  black)  demarcated  ev¬ 
ery  20  seconds  (black  squares)  with  leader  motes  (blue 
stars)  and  received  message  times  (blue  dotted  lines). 


(b)  GPS  measured  robot  path  (solid  black)  demarcated  ev¬ 
ery  20  seconds  (black  squares)  with  Kalman  filter  esti¬ 
mated  position  (solid  orange). 
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(c)  Position  error:  sensor  network  detected  (dashed  blue)  and  Kalman  filter  estimated  (solid  orange). 


Figure  2.6:  Results  collected  from  a  single  robot  traversing  a  7  by  7  sensor  network.  Faulty  motes 
((4, 10)  and  (4, 12))  and  noisy  motes  ((12, 12),  (12,0),  and  (4,4))  have  been  manually  silenced. 
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2.4  Lessons 

PEG  taught  us  many  practical  lessons  for  design,  development,  and  deployment  of  a 
medium-scale  sensor  network  and  control  system.  We  learned  that  mote  enclosures  should  be 
tailored  specifically  for  development  or  deployment,  that  every  physical  interaction  with  a  mote 
is  likely  to  cause  damage,  that  a  high  gain  antenna  snooping  on  the  network  is  an  invaluable  de¬ 
bugging  tool,  and  that  many  additional  software  services  for  in  situ  debugging  and  interaction  are 
needed.  However,  here,  our  focus  is  on  using  the  sensor  network  as  a  sensor  for  a  control  system. 

From  a  control  system's  perspective,  perfect  WSN  data  would  be  such  that  the  node  loca¬ 
tions  are  known,  and,  for  all  time,  the  physical  quantity  to  be  sensed  is  known  at  every  node  with  no 
latency.  However,  PEG  demonstrated  our  controller  would,  aperiodically  and  with  a  low  data  rate, 
receive  noisy  measurements  at  loosely  known  regions  of  space-time.  We  categorized  properties  of 
the  PEG  WSN  that  make  traditional  control  techniques  challenging  to  apply: 

•  Sensor  error 

-  sensor  noise 

-  modeling  error 

-  inter-mote  calibration  error 

-  timing  error 

-  localization  error 

•  False  events 

-  spurious  detections 

-  missing  detections 

•  Network  induced  error 

-  aperiodic 

-  low  data  rate 

-  latency 

Sensor  error  refers  to  the  error  in  the  sensing  system  during  detection  of  true  events  (i.e., 
excluding  any  false  positives  or  negatives)  due  to  sensor  noise,  modeling  error,  inter-mote  calibra¬ 
tion  error,  timing  error,  and  localization  error.  Additional  error  is  induced  in  the  system  by  spurious 
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and  missing  events,  caused  by,  for  example,  faulty  hardware  or  environmental  noise.  Finally,  net¬ 
work  characteristics  denote  the  tendency  of  sensor  measurement  packets  to  arrive  aperiodically  at  a 
low  rate  and  with  significant  latency. 

Just  as  other  researchers  [85]  have  discovered  experimentally  the  difficulties  of  sensing 
applications  using  sensor  networks,  PEG  has  done  this  for  control  systems.  This  chapter  has  de¬ 
scribed  the  difficulty  in  using  sensor  network  platforms  to  inform  control  systems.  In  the  next 
chapter,  we  identify  several  techniques  to  overcome  these  difficulties.  Later  chapters  evaluate  the 
performance  of  these  techniques. 
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Chapter  3 

Practical  System  Architecture 


Our  PEG  deployment  demonstrated  that  several  properties  of  sensor  networks  (discussed 
in  Section  2.4)  frustrate  control  system  design.  This  chapter  investigates  techniques  for  overcoming 
these  challenging  properties  and  develops  an  architecture  that  unifies  these  techniques.  The  afore¬ 
mentioned  challenges  can  be  address  in  many  ways  ranging  from  hardware  design  to  algorithm 
parameter  choice.  In  our  work,  the  localization  and  routing  algorithms  along  with  the  mote  hard¬ 
ware  is  assumed  to  be  fixed  and  given.  Additionally,  the  node  density  and  deployment  strategy  is 
also  assumed  to  be  provided  a  priori.  Hence,  our  approach  focuses  on  using  combinations  of  simple 
algorithms  to  improve  the  overall  system  performance.  In  later  chapters,  the  developed  architecture 
is  tested  with  a  series  of  detailed  simulations. 

Furthermore,  as  our  architecture  is  evolved,  we  will  keep  in  mind  that  many  useful  sensor 
network  services  have  already  been  developed.  In  particular,  services  such  as  localization  [93],  time 
synchronization  [34],  data  query  and  aggregation  [60],  and  run-time  configuration  utilities  [69] 
have  been  developed.  Additionally,  other  architectures  for  networked  control  systems  have  been 
studied.  Brooks  [14]  developed  an  architecture  that  allows  for  graceful  degradation  of  a  control 
system  as  connectivity  decreases,  provided  that  actuators  always  receive  input  from  the  lowest  level 
controller,  a  controller  designed  to  implement  basic,  safe  functionality.  However,  the  design  of  such 
a  controller  is  left  to  the  reader.  Our  work  seeks  to  develop  an  architecture  for  SNAC  systems  based 
on  existing  services  and  seeks  to  design  such  a  control  system. 
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3.1  Sensor  Error 

Sensing  error  due  to  noise,  modeling  disparities,  inter-mote  calibration  and  timing  differ¬ 
ences,  and  localization  error  can  be  addressed  in  a  variety  of  ways.  A  solution  to  noise  is  to  obtain 
more  frequent  samples.  However,  the  number  of  samples  is  fundamentally  limited  by  both  the  hard¬ 
ware  and  the  bandwidth  of  the  network,  in  addition  to  any  self-imposed  limits  designed  to  meet 
other  goals  such  as  reduced  power  consumption.  Many  of  theses  requirements  can  be  addressed 
by  efficient  use  of  the  bandwidth.  In  particular,  for  a  network  that  supports  multiple  clients  and 
objectives,  the  networking  layer  can  operate  in  a  multimodal  fashion,  each  mode  tailored  to  each 
objective.  For  instance,  during  PEG,  the  network  could  operate  both  as  a  localize  pursuer  service 
and  a  track  evader  service.  Messages  used  to  localize  the  pursuer  could  be  designed  to  incur  less 
overhead  since  they  require  only  single  hop  communication. 

Additional  error  due  to  modeling  discrepancies,  inter-mote  calibration  differences,  and 
localization  error  can  be  addressed  with  a  space-time  model  of  the  system  variables  and  an  accom¬ 
panying  estimation  technique  (such  as  probabilistic  models  with  maximum  likelihood  estimation 
techniques).  We  refer  to  such  a  model  and  estimation  technique  as  a  neighborhood  sensor  model. 
Not  only  can  such  models  accurately  interpret  sensor  readings,  but  some  researchers  [92]  have  used 
them  to  calibrate  sensors.  Finally,  timing  differences  amongst  motes  induce  additional  error,  and 
can  be  addressed  by  accounting  for  time  discrepancies  in  the  neighborhood  model  or  by  utilizing  a 
coordination  service.  Such  a  service,  possibly  built  on  top  of  a  time  synchronization  service,  would 
ensure  neighboring  motes  sample  their  sensors  at  the  same  time. 

3.2  False  Events 

False  events,  composed  by  spurious  and  missing  readings,  contribute  to  additional  error 
in  the  sensing  platform.  Such  events  can  be  caused  by  broken  or  incorrectly  calibrated  hardware.  In 
this  case,  problematic  motes  must  be  identified  and  action  must  be  taken  to  ensure  these  motes  do 
not  increase  the  overall  system  error.  One  method  of  identifying  a  faulty  mote  is  to  perform  a  status 
query,  a  request  for  the  current  status,  on  a  mote.  A  network-wide  status  query  provides  a  list  of 
motes  that  either  diagnose  themselves  as  faulty  or  are  not  responding  to  radio  messages.  Another 
method  of  determining  the  status  of  motes  is  to  use  a  neighborhood  sensor  model  for  a  group  of 
motes  and  a  known  stimuli  to  determine  via  collaboration  if  some  motes  are  faulty  or  incorrectly 
calibrated.  Spurious  readings,  often  caused  by  environmental  noise  or  faulty  hardware,  can  be 
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identified  using  validation  or  verification  techniques.  Verification  uses  a  hand  shaking  protocol 
between  the  sensor  network  and  a  cooperative  entity  to  ensure  that  only  the  requested  measurements 
are  announced  by  the  mote  network.  For  instance,  a  cooperative  robot  might  periodically  emit  a  low 
power  radio  message  requesting  nearby  motes  to  reply  with  ranging  measurements.  Far  away  motes 
that  reply  to  the  robot  can  be  labeled  as  potentially  faulty.  Sensor  measurements  from  nearby  motes 
can  also  be  validated  using  a  neighborhood  sensor  model  as  previously  discussed. 

Once  groups  of  motes  have  been  identified  as  potentially  problematic,  a  sensor  network 
maintenance  routine  can  be  applied  that  automatically  calibrates,  restarts,  replaces,  or  turns  off  such 
motes.  Automatically  calibrating  a  mote  involves  using  a  neighborhood  sensor  model  to  predict 
the  mote’s  sensor  field  based  on  measurements  from  nearby  motes,  comparing  this  with  the  mote’s 
actual  measurements,  and  adjusting  the  mote’s  calibration  parameters  accordingly.  Restarting  or 
turning  off  a  mote  would  require  issuing  such  a  command  to  the  mote  over  the  radio.  Replacing  the 
mote  would  involve  navigating  to  the  faulty  mote’s  location  and  deploying  new  hardware.  Finally, 
for  cooperative  robots,  hardware  identified  as  faulty  can  be  ignore  and  avoided  by  employing  an 
intelligent  path  planning  routine. 

3.3  Network  Induced  Error 

Properties  of  the  networking  system  such  as  multi-hop  packet  delivery,  broadcast  colli¬ 
sion,  and  shared  bandwidth  induce  detection  error  when  messages  arrive  aperiodically,  with  high 
latency,  and  at  a  low  data  rate.  Aperiodic  arrival  of  detection  messages  is  caused  both  by  the  event 
triggered  nature  of  distributed  entity  detection  and  by  the  stochastically  varying  latency  intrinsic 
in  the  routing  layer.  Detections  can  potentially  be  made  periodic  with  low  jitter  values  by  sensing 
coordination  as  previously  mentioned.  Latency  and  low  data  rates  can  be  addressed  by  optimiz¬ 
ing  communication  for  multiple  control  modes  (as  previously  mentioned),  artificially  slowing  down 
the  dynamics,  or  by  using  a  predictive  controller.  A  predictive  controller  assumes  a  parametric 
model  of  the  dynamics,  and  is  composed  of  a  goal  controller  with  a  state  and  parameter  estimator 
allowing  it  to  operate  in  the  face  of  missing  and  late  measurements.  Finally,  tight  bounds  on  when 
sensor  measurements  occur  are  often  not  provided  (such  as  during  PEG).  This  can  be  addressed 
with  network-wide  time  synchronization,  hand  shaking  when  measurements  should  be  performed, 
or  with  a  neighborhood  routing  model  that  estimates  the  time  of  detections  based  on  characteristics 
of  the  network. 
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Figure  3.1:  Mote  information  flow.  Responding  to  messages  from  the  radio,  the  mode  controller 
selectively  invokes  services.  Services  interact  with  the  environment  by  querying  sensors  and  sending 
messages. 

3.4  Unified  Framework 

This  section  combines  the  previous  design  solutions  into  a  practical  system  architecture 
for  SNAC  systems  overcoming  challenging  sensor  network  properties  such  as  sensor  error,  false 
events,  and  network  induced  error.  The  techniques  identified  above  indicate  that  this  unified  frame¬ 
work  should  implement  a  subset  of  the  following  services: 

•  Predictive  controller 

-  Parametric  model 

-  Goal  controller 

-  State  and  parameter  estimation 

•  Neighborhood  model 

-  Sensing 

-  Routing 

•  Intelligent  path  planning 


•  Multimodal  controller 
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Figure  3.2:  Agent  information  flow.  Using  radio  messages,  mode  information,  and  (optionally)  on¬ 
board  sensors,  the  estimation  service  infers  the  system-wide  state.  Using  additional  messages  and 
state  estimation,  the  mode  controller  invokes  services  to  achieve  the  agent’s  overall  goal.  Services 
interact  with  the  environment  by  sending  messages  and  engaging  actuators. 


-  Networking 

-  Control  strategy 

•  Coordination 

•  Status  query 

•  Time  synchronization 

•  Hand  shaking 

•  Sensor  network  maintenance  routine 

-  Calibrates 

-  Restart 

-  Replace 

-  Shutoff 

Using  this  list  as  a  guide,  we  propose  a  2  part  architecture:  the  mote  architecture  shown 
in  Figure  3.1  and  the  agent  architecture  shown  in  Figure  3.2.  Both  designs  are  similar  with  the 


32 


exception  that  the  mote  does  not  perform  estimation  or  actuation.  The  2  frameworks  work  together 
in  a  loose  client-server  relationship.  The  mote  generally  either  runs  a  fixed  set  of  services  or  se¬ 
lectively  invokes  services  according  to  agent  demand.  Occasionally,  the  mote  may  invoke  services 
autonomously;  for  instance,  a  watchdog  may  invoke  a  mote’s  maintenance  service  and  cause  a  re¬ 
boot. 

The  high  level  flow  of  information  in  the  mote  architecture  is  event-triggered.  Respond¬ 
ing  to  radio  messages,  the  mode  controller  selectively  invokes  various  services.  Various  sensing 
services,  such  as  detection  and  tracking,  send  detection  messages  triggered  by  sensing  events.  Addi¬ 
tionally,  for  a  typical  tracking  application,  the  mode  controller  may  continuously  run  a  time  synchro¬ 
nization  and  tracking  service  while  invoking  other  services  such  as  status  reports  or  maintenance 
when  requested  by  the  agent. 

The  agent  architecture  can  be  implemented  as  a  time-triggered  architecture.  Periodically, 
using  any  recently  received  messages  and  mode  switching  information,  the  agent  actively  estimates 
system-wide  parameters  related  to  sensor  calibration,  routing,  and  robot  dynamics.  The  mode  con¬ 
troller  using  transition  cues  from  the  network  and  a  current  system-wide  state  estimate  selectively 
invokes  various  services.  Invoked  services  influence  the  system’s  behavior  by  sending  messages 
and  controlling  the  actuators.  For  instance,  during  a  PEG  application,  the  mode  controller  may 
continuously  invoke  navigation  and  path  planning  services  to  track  an  evader,  while  selectively  in¬ 
voking  different  networking  modes  and  maintenance  routines  to  balance  the  robot’s  localization  and 
tracking  error. 

Together  these  architectures  enable  the  aforementioned  design  rules.  The  mode  controller 
allows  us  to  switch  between  various  networking  and  control  modes.  The  navigation  controller  and 
path  planning  services  utilizes  the  predictive  controller  and  intelligent  path  planning  concepts.  The 
estimation  services  utilizes  neighborhood  models.  Finally,  various  services  on  both  architectures 
take  advantage  of  techniques  such  as  coordination,  time  synchronization,  status  query,  hand  shaking, 
and  maintenance.  Some  of  the  design  rules  and  correlating  services  are  explored  in  more  detail  in 
later  chapters. 


33 


Chapter  4 

The  Sensor  Network  and  Control 
System  Simulator 


Previous  chapters  have  described  real  world  SNAC  implementations,  the  challenges  these 
systems  present,  and  methods  of  addressing  these  challenges.  This  chapter,  develops  formal  models 
of  the  SNAC  system  and  an  accompanying  simulator.  Furthermore,  this  chapter  performs  several 
simulations.  The  first  set  of  simulations  compare  components  of  the  previously  developed  system 
architecture  with  a  traditional  design.  The  next  set  of  simulations  establishes  the  importance  of  path 
planning  for  robot  navigation  in  sensor  networks. 

4.1  System  Models 

Using  our  experience  from  PEG  and  results  from  the  literature  [37,  11],  a  set  of  models 
and  a  simulator  for  a  mobile  robot  embedded  within  a  sensor  network  is  developed.  In  order  to 
accurately  capture  the  most  challenging  aspects  of  sensor  networks,  models  are  developed  to  ac¬ 
count  for  sensing  noise,  sensor  saturation,  calibration  differences,  packet  collision,  radio  reception 
range,  multi-hop  latency,  finite  battery  lifetime,  faulty  hardware,  and  loose  time  synchronization. 
Additionally,  the  robot  model  captures  the  actuator  noise  and  nonholonomic  dynamics.  In  the  fol¬ 
lowing  sections,  models  for  the  sensor,  the  communication  layer,  the  mote  hardware,  the  detection 
and  aggregation  algorithm,  and  the  robot  dynamics  are  developed. 
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Figure  4.1:  Sensor  state  transition  diagram. 


4.1.1  Sensing 

For  PEG,  the  motes  used  on-board  magnetometers  to  detect  entities  in  the  network.  Our 
experience  with  the  mote  hardware  indicates  that  the  measured  value  reported  by  a  mote  is  a  function 
of  the  state  of  the  sensor  and  the  magnetic  field  near  the  mote.  In  particular,  each  sensor  could  be 
in  any  one  of  5  states:  normal,  noisy,  unresponsive,  unpredictable,  or  saturated.  A  sensor  in  a 
normal  state  reports  a  value  B  =  Bo  +  e  where  Bo  is  the  magnitude  of  the  magnetic  field  and  e  is 
normally  distributed  noise.  A  noisy  sensor  reports  the  same  measured  value,  but  typically  has  much 
higher  noise  variance  due  to  a  priori  exposure  to  intensive  magnetic  fields  or  faulty  hardware.  An 
unresponsive  sensor  never  reports  measurements  due  to,  for  instance,  a  weak  battery  or  a  faulty 
radio.  Sensors  in  a  unpredictable  state  report  a  normally  distributed  value.  This  state  models  faulty 
sensors  that  tend  to  report  only  noise  typically  due  to  exposure  to  intensive  magnetic  fields.  Finally, 
a  sensor  in  the  saturated  state  reports  a  constant  measured  value  B  =  Bsat.  Sensors  enter  saturation 
when  exposed  to  large  magnetic  fields  and  typically  resume  normal  operation  after  a  short  period  of 
time. 

During  an  experiment,  a  sensor  can  occasionally  transition  between  states  as  shown  in 
Figure  4.1.  Before  an  experiment  begins,  all  sensors  are  in  the  initial  state  of  deploy.  As  each 
mote  is  placed  in  the  field,  its  sensor  transitions  to  the  unpredictable,  unresponsive,  noisy,  or  normal 
state  with  probability  pup,  pur,  pn,  or  1  -  (pup  +  pur  +  pn),  respectively.  Once  a  sensor  becomes 
unresponsive  or  unpredictable,  it  remains  this  way  (until  maintained  or  re-deployed).  Sensors  that 
are  deployed  in  the  normal  or  noisy  states  occasionally  transition  to  the  saturation  state  during 
normal  operation  if  the  detected  field  B  exceeds  the  saturation  threshold  B,i,res.  Once  the  sensor  has 
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been  in  saturation  for  a  duration  of  Tsat  seconds,  it  returns  to  its  previous  state.  Furthermore,  if  a 

mote  is  deployed  in  the  normal  (noisy)  state,  its  error  bias  p^  is  normally  distributed  as  W(0,  cr2) 

(MO,  fr2))  and  its  error  variance  cr2  is  normally  distributed  as  MO,  cr2)  (MO,  cr2 )). 

The  actual  field  the  sensor  is  exposed  to  (denoted  by  Bo)  is  modeled  as  a  magnetic  dipole 

far  field.  In  particular,  each  detectable  entity  is  assumed  to  be  a  magnetic  dipole  with  dipole  moment 
iT 


m  = 


0  0  M 


at  a  height  of  dm  (in  meters)  above  the  field,  where  M  >  0  (in  Am2)  and  dm  is 


much  larger  than  the  length  of  the  dipole.  Letting  r  be  the  vector  from  the  dipole  to  a  particular 
mote,  the  magnitude  of  the  magnetic  field  (in  Teslas)  at  the  mote  is 
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where  hq  is  the  permeability  of  free  space  (in  Wb/(Am)).  Therefore,  with  an  entity  at 
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,  it  can  be  shown  that 
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where  p  —  xm  -  xv  ym  -  yv  is  the  distance  (in  meters)  in  the  plane  between  the  mote  and  the 
entity.  Note,  the  largest  field,  attained  when  the  entity  is  directly  on  top  of  the  mote,  is 

poM 


Bq(p  =  0)  = 
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(4.3) 


4.1.2  Communication 


The  communication  model  abstracts  the  combined  interaction  of  the  lossy  radio  channel 
with  the  multi-hop  routing  protocols.  In  particular,  2  communication  scenarios  are  considered: 
message  broadcasting  (single-hop  communication)  and  multi-hop  routing.  The  broadcast  com¬ 
munication  model  accounts  for  medium  access  control  (MAC)  delays,  packet  collision,  and  non- 
deterministic  reception  ranges.  In  particular-,  when  a  mote  sends  a  broadcast  message,  the  packet 
is  first  tested  for  collision.  With  probability  Pdrop ,  the  message  collides  and  is  lost.  If  not  lost,  the 
message  is  assigned  a  communication  delay  of  Tiag  ~  (U\t']u^  rj'  ].  After  r\ag  seconds  have  elapsed, 
the  message  is  delivered  to  each  neighbor  at  distance  d  with  probability  pdist(d).  The  reception 
probability  is  modeled  after  the  work  by  Ganesan  et  al  [37]  and  Cerpa  et  al  [19].  In  particular,  this 
work  determined  that  the  reception  probability  is  not  1  even  at  a  distance  0  and  does  not  drop  to  0 
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Figure  4.2:  Single-hop  reception  probability  pcnst(d)  =  K(d,p.  cr)  for  p  =  2.5  and  cr  =  1.5. 


even  for  rather  large  distances.  This  reception  probability  pdist(d)  is  approximated  here  with 


Pdist(d)  =  cr)  =  0.5  1  -  erf 


d  -  p 

vT 


O" 


(4.4) 


for  some  fixed  values  p,cr.  An  example  of  this  function  for  p  =  2.5  and  cr  =  1.5  is  shown  in 
Figure  4.2. 

The  multi-hop  routing  model  accounts  for  packet  loss  and  lag  induced  by  each  hop.  This 
model  assumes  the  underlying  routing  algorithm  ensures  a  robust  network  topology  that  provides 
a  route  between  any  sender  and  receiver  pair.  Hence,  each  single-hop  link  along  a  multi-hop  route 
is  assumed  to  be  reliable  and  the  reception  probability  model  pdistid)  is  ignored.  However,  since 
each  transmission  may  still  be  loss  due  to  collision,  each  hop  may  fail  with  probability  Pdrop ■  Fur¬ 
thermore,  it  is  assumed  that  the  number  of  hops  between  any  sender  and  receiver  pair  can  be  ap¬ 
proximated  by  h}wp  =  r-p-1  where  cl,  is  the  straight  line  distance  between  the  sender  and  receiver 

“ hop 


and  d]Wp  is  the  estimated  average  distance  per  hop.  Using  this  model,  each  multi-hop  message  is 
first  duplicated  for  each  potential  receiver.  Then,  for  each  message,  the  straight  line  distance  to  the 
receiver  dr  and  the  estimated  hop  count  h/lop  is  computed.  Then,  each  hop  i  e  {1,2,...,  h/lop}  either 
fails  with  probability  Pdrop  and  the  message  is  lost,  or  the  hop  succeeds  and  adds  an  additional  lag 
of  rj  ~  1 Y[tJ  ,  rj'  ]  to  the  overall  packet  latency  Tiag  =  2/  T)a,y  If  all  hops  succeed,  the  message 
is  delivered  T/ag  seconds  later  to  its  receiver. 
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4.1.3  Mote  Platform 

The  mote  platform  model  accounts  for  the  variation  of  times  motes  come  online  and  each 
mote’s  finite  battery  lifetime.  In  particular,  motes  may  be  manually  switched  on  one  at  a  time  or  the 
network  may  be  sent  a  start-up  radio  message.  In  either  case,  individual  motes  will  become  active 
at  different  times.  This  is  modeled  by  allowing  the  initial  sensing  time  ts  of  a  mote  be  uniformly 
distributed:  ts  ~  U\  r'.,  r"].  Furthermore,  once  a  mote  is  turned  on,  it’s  battery  (or  potentially  other 
vital  hardware)  will  eventually  fail.  This  is  modeled  by  setting  a  uniformly  distributed  expiration 
time  Te  for  each  mote:  re  ~  'ZYfr' ,  t”\. 

4.1.4  Detection  Routine 

The  detection  routine  models  the  entity  detection  and  aggregate  algorithm  that  is  pro¬ 
grammed  on  the  motes.  This  models  the  same  algorithm  implemented  by  PEG.  In  particular,  each 
mote  with  a  measured  magnetic  field  B  larger  than  a  given  threshold  Bthres  announces  the  measured 
value  over  the  radio  to  its  neighboring  motes  using  a  detection  message.  A  detection  message  con¬ 
tains  the  announcing  mote’s  id,  location,  and  the  measured  magnetic  field.  Once  a  mote  announces 
a  detection,  it  must  wait  for  at  least  Treport  seconds  before  reporting  a  detection  again.  Additionally, 
any  mote  receiving  a  detection  message  will  store  the  message  for  Treport  seconds  before  discarding 
it.  If  a  detecting  mote  measures  a  magnetic  field  larger  than  the  magnetic  field  reported  by  any  of 
its  neighbors  within  the  last  Treport  seconds,  this  mote  will  elect  itself  as  a  leader.  As  a  leader,  it 
aggregates  all  the  detection  reports  from  its  neighbors  (within  the  last  Treport  seconds)  into  an  event 
packet  and  sends  this  packet  (via  the  multi-hop  routing  layer)  to  the  pursuer  robot. 


4.1.5  Robot 


The  robot  model  accounts  for  the  kinematics  of  the  pursuer  and  evader  robots.  For  PEG, 
the  pioneer  robot  from  ActivMedia  was  used.  However,  for  many  other  test  beds,  the  COTS- 
BOTS  [11]  platform,  a  mote  controlled  small  RC  car,  is  used.  The  robot  model  approximates 
the  nonholonomic,  car-like  kinematics  of  the  COTS-BOTS  that  can  virtually  instantaneously  set  its 

lT 


steering  angle  and  wheel  velocity.  In  particular,  the  position  and  orientation 
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Figure  4.3:  The  robot  model’s  state  and  parameters. 
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where  b  is  the  wheelbase,  i]k  is  the  change  in  orientation,  T  is  the  sampling  period,  vk  is  the  wheel 
velocity,  and  cpk  is  the  steering  angle  as  shown  in  Figure  4.3.  Additionally,  the  achievable  values  for 
wheel  velocity  and  steering  angle  are  limited  by  the  hardware: 


Vk  ^  \_V min  •>  Vmax] 

(4.8) 

fik  ^  [0mm  •>  tymax\ 

(4.9) 

(4.10) 

where  vm-in  <  0  <  vmax  and  <  0  <  <pmax-  The  applied  wheel  speed  and  steering  angle  are  noisy 
functions  of  the  user  supplied  control  values  vk  and  <f>k.  In  particular, 


Vk 

=  vk  +  evk 

(4.11) 

4>k 

+ 

II 

(4.12) 

4 

~  AC  (0,  cr2) 

(4.13) 

4 

~  H(0,o*) 

(4.14) 
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Finally,  the  control  bounds  {vmin,vmax,<pmin,(pmax}  are  considered  fixed,  unknown  param¬ 
eters  of  the  system. 

4.2  Agent  Design 

This  section  develops  several  of  the  system  architecture  components  from  Section  3.4 
including  neighborhood  estimation  services,  a  predictive  controller,  and  a  path  planner.  First,  an 
extended  Kalman  filter  is  developed  for  state  estimation.  Then,  two  modifications  to  the  estimation 
system  are  proposed:  network  latency  compensation  and  faulty  node  filtering.  Next,  a  predictive 
navigation  controller  is  designed.  Finally,  a  feedback  controller  for  navigation  is  developed  to  com¬ 
pare  with  the  predictive  controller. 

4.2.1  State  Estimation 

In  this  section,  an  extended  Kalman  filter  (EKF)  [6]  is  designed  to  provide  state  estimates 
to  the  controller.  In  following  sections,  the  estimator  is  augmented  to  compensate  for  false  posi¬ 
tives  and  communication  delay.  An  EKF  is  designed  analogously  to  a  Kalman  filter  by  linearizing 
nonlinearities  in  the  dynamics  at  each  point  of  estimation.  For  each  new  time  step  k  +  1  and  new 
sensor  measurement  y^+i ,  the  EKF  iteratively  computes  the  new  state  estimate  x^+i^+i  and  the  error 
covariance  Pk+\\k+\-  The  update  occurs  iteratively  in  2  steps:  a  proprioceptive  update  and  a  precep¬ 
tive  update.  The  proprioceptive  update  evolves  the  state  forward  to  realize  xk+\\k  according  to  the 
system  dynamics  with  no  noise,  and  evolves  the  error  covariance  forward  according  to 


Pk+l\k  =  FkPk\kFl  +  Qk 


(4.15) 


where  Qk  is  the  noise  covariance  of  the  dynamics  and  Fk  is  the  Jacobian  of  the  dynamics.  For  the 
car  dynamics,  it  can  be  shown  that 


1  0  -VkT  sin(%) 


Fk=  0  1  VkT  cos(6k) 

0  0  1 


(4.16) 


for  a  car  moving  in  a  straight  line,  or 


1  0  b  cot(<^t)[cos 0k\k  +  m )  -  cos(%)] 
Fk  =  0  1  b  cot(0*)[sin(%  +  rjk)  -  sin (%)] 


(4.17) 


0  0 


1 
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for  a  car  that  is  turning  where  %  =  v<Ts“1(l/>t). 

The  preceptive  update  is  computed,  using  the  proprioceptive  update,  as 


74+1 

=  Pk+i\kHl+l(Hic+iPk+i\kHl+l  +74+i)  1 

(4.18) 

4+l|fc+l 

=  4+l|Jt  +  74+i04:+i  -  T4+1.%+1|*) 

(4.19) 

74+i|*+t 

=  (7  -  74+iT4+i)P*+i|* 

(4.20) 

where  74+ 1  is  the  covariance  of  the  sensor  noise  and  77*+ 1  is  the  Jacobian  of  the  measurement  func¬ 
tion.  Since  event  messages  can  be  transformed  into  a  single  position  estimate  (using,  for  instance,  a 
center  of  mass  computation),  the  measurement  function  is  taken  to  be 


yk 


1  o  o 
0  1  0 


Ok 


+  e 


(4.21) 


where  e  is  the  sensor  noise  which  is  normally  distributed  with  covariance  74-  Hence,  for  this  model, 
74+i  is  given  by  the  constant  output  matrix: 


74+ 1 


1  0  0 
0  1  0 


(4.22) 


4.2.2  Network  Latency  Compensation 

Using  the  above  computations,  the  state  can  be  estimated  for  each  time  step  using  knowl¬ 
edge  of  the  control  values  and  the  new  sensor  measurements.  However,  the  sensor  network  does 
not  provide  a  sensor  measurement  at  each  time  step.  Furthermore,  received  measurements  can  be 
out  of  order  or  late.  Hence,  the  input  to  the  EKF  is  adapted  to  account  for  these  characteristics.  If  a 
new  sensor  measurement  is  not  available,  only  the  proprioceptive  update  is  applied.  If  a  new  sensor 
measurement  is  available  it  is  used  to  correct  the  state  estimate  t,j  seconds  in  the  past  where  tci  is 
the  estimated  mean  latency  of  received  messages.  After  a  past  state  estimate  is  corrected,  the  filter 
is  re-applied  to  generate  a  current  estimate. 


4.2.3  Faulty  Node  Filtering 

To  account  for  faulty  motes  that  frequently  report  false  positives,  a  message  validation 
filter  is  applied  to  all  incoming  messages.  This  filter  estimates  a  packet’s  validity  probabilistically 
as  p{v\n)  where  n  is  the  number  of  sensor  readings  in  the  packet  and  v  e  {valid,  invalid)  is  the 
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packet’s  validity.  Applying  maximum  likelihood  estimation,  the  validity  of  a  packet  with  h  sensor 
readings  is  the  argmax  over  v  of  p{n\v)  assuming  a  uniform  prior  distribution  for  v.  Clearly,  this 
model  either  accepts  or  rejects  packets  based  on  the  number  of  sensor  readings. 


4.2.4  Predictive  Controller 


A  model  predictive  controller  [82]  (MPC)  is  developed  for  way-point  navigation  of  the 
car-like  robot.  This  controller  achieves  the  destination  point  Xf  using  two  maneuvers:  turning  and 
driving  straight.  First,  during  the  turning  phase,  the  vehicle  is  driven  in  a  tight  circle  taking  the 


vehicle  from  a  starting  state 


6n 


phase,  the  vehicle  is  driven  straight  (cf>  =  0)  to 


to  a  intermediate  state 

T 

Xf  *' 


-iT 


xb  Gb 


.  Second,  during  the  straight 


where  the  orientation  is  not  specified. 


The  computation  is  carried  out  by  first  determining  the  quadrant  ql  (relative  to  the  robot 
coordinate  system)  the  destination  point  lies  in.  Next,  it  is  determined  whether  the  destination  point 
lies  inside  the  turning  radius  of  the  robot  at  its  initial  position  (r1  =  true)  or  not  (r1  =  false).  Using 
these  two  computations,  the  controller  chooses  an  initial  control  value: 

•  (v,  (f>)  =  (vmax,  (pmax )  if  e  {(2,  false),  (4,  true)} 

*  0^0)  —  (ymini&max)  if  (ql,rl)  e  {(3, false), (1, true)] 


•  (v,  0)  =  (vmax, <pmin )  if  (ql,rl)  e  {(1,  false),  (3,  true)} 

•  (v,  ((>)  =  (vmin,  (pmin )  if  (ql ,  r1)  e  {(4,  false),  (2,  true)} 


This  control  action  is  applied  exactly  long  enough  for  the  robot  to  reach  an  intermediate  state  that 
admits  a  straight  path  to  the  destination.  Next,  a  control  action  that  drives  the  robot  straight  to  the 
destination  is  computed  by  determining  which  quadrant  q2  the  destination  currently  lies  in: 

•  (v,  (f>)  =  (vmax,  0)  if  q2  6  {1, 2} 

•  (v,  (f>)  =  (ymin,  0)  if  q2  e  (3, 4} 


The  second  control  action  is  applied  exactly  long  enough  for  the  robot  to  reach  the  destination. 


4.2.5  Feedback  Controller 

A  simple  state  feedback  controller  is  developed  to  challenge  the  performance  of  the  pre¬ 
dictive  controller.  To  simplify  future  analysis,  the  feedback  controller  is  kept  quite  simple:  way- 
points  are  assumed  to  be  far  away  (in  comparison  to  the  wheelbase)  and  way-points  are  considered 
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to  be  successfully  achieved  when  the  car  is  within  dgoai  >  b.  Given  these  constraints,  the  feedback 
controller  is  designed  to  proportionally  track  the  desired  orientation  (the  robot  orientated  towards 
the  goal)  and  reduce  the  wheel  speed  as  the  destination  becomes  close.  In  particular,  the  control 
values  are 

v,.l  [v . T 1  —  p-\\xk-Xf\\~\ 

(4.23) 


is  the  state  estimate  at  time  k,  vmax  is 
e,  and  p(/,  e  R+  is  a  control  parameter. 


Vk 

W 1  - 

<Pk 

P$k  -  <) 

where  ff!  =  arctan(  '  .,)  is  the  desired  orientation,  xk  9k 

K  xf  xk 

a  fixed  a  priori  estimate  of  the  maximum  speed  of  the  vehic 


4.3  Controller  Comparison  Simulations 

Using  the  aforementioned  models  and  control  systems,  several  simulations  are  performed 
to  compare  the  performance  of  the  control  architecture  from  Section  3.4  to  that  of  a  simple  feedback 
controller.  In  particular,  the  estimation  accuracy  and  navigation  ability  of  five  different  system 
architectures  is  compared.  The  simulator  is  configured  to  emulate  the  PEG  sensor  network.  In 
particular,  the  parameter  values  used  are  given  in  Appendix  A  unless  otherwise  noted. 

Five  system  architectures  are  compared.  System  1  uses  the  feedback  controller  and  the 
plain  EKF  previous  designed.  The  plain  EKF  estimator  does  not  use  network  latency  compensation 
or  faulty  node  filtering.  System  2  augments  the  estimator  of  System  1  with  faulty  node  filtering. 
System  3  augments  the  estimator  of  System  2  with  network  latency  compensation.  System  4  uses 
the  model  predicative  controller,  the  EKF,  and  faulty  node  filtering.  Finally,  system  5  augments  the 
estimation  of  system  4  with  network  latency  compensation. 


iT 


10  10 


,  and 


For  each  simulation,  10  trials  were  ran,  the  destination  goal  was  set  to 
the  goal  tolerance  was  set  to  10  cm.  Each  trial  was  ran  until  the  goal  was  achieved  or  150  seconds 
elapsed.  An  example  trial  for  each  system  is  shown  in  Figure  4.4.  For  each  figure,  the  gray  sym¬ 
bols  represent  the  locations  of  sensor  nodes  with  each  symbol  denoting  the  node’s  status:  normal 
(gray  dot),  non-re sponsive  (gray  x),  noisy  (gray  asterisk),  and  unpredictable  (gray  pentagram).  Fur¬ 
thermore,  the  robot's  starting  and  ending  states  are  denoted  by  a  green  rectangle  and  arrow.  The 
desired  ending  location  is  drawn  as  3  concentric  black  circles.  Finally,  the  estimated  robot  position 
during  the  trial  is  denoted  by  light  red  dots  connected  by  a  light  red  dotted  line.  Each  smaller  fig¬ 
ure  is  captioned  with  the  system  architecture  components  used:  Feedback  (FB),  extended  Kalman 
filter  (EKF),  model  predictive  controller  (MPC),  faulty  node  filtering  (FNF),  and  network  latency 
compensation  (NFC). 
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(a)  System  1:  FB,  EKF 


20 


(b)  System  2:  FB,  EKF,  FNF 


(c)  System  3:  FB,  EKF,  FNF,  NLC 


(d)  System  4:  MPC,  EKF,  FNF 


(e)  System  5:  MPC,  EKF,  FNF,  NLC 


Figure  4.4:  Example  trial  for  5  different  control  architectures  depicting  path  and  estimation  results. 
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System 

ftgoal 

tgoai 

l^est 

ex 

eg 

1 

0 

NA 

5001.0 

2.98 

1.95 

2 

3 

81.1 

4311.5 

1.21 

1.51 

3 

1 

66.1 

4721.2 

1.82 

2.01 

4 

2 

108.5 

8.6 

1.15 

0.56 

5 

4 

111.7 

8.4 

1.14 

0.54 

Table  4.1:  Simulation  results  comparing  5  different  system  architectures  for  10  trials.  The  first  3 
system  utilize  a  feedback  architecture  and  the  last  2  utilize  a  MPC  based  architecture.  Shown  above 
is  the  number  of  goal  achieving  trials  ngoai,  the  mean  destination  travel  time  tgoai,  the  mean  number 
of  estimations  required  hest,  the  mean  estimated  position  error  ex,  and  mean  estimated  orientation 
error  eg. 


Referring  to  this  figure,  it  is  shown  that  for  system  1,  the  estimated  path  is  smeared  to¬ 
wards  the  unpredictable  mote  in  the  lower  right  of  the  field.  This  is  expected  since  the  estimator 
does  not  account  for  such  faulty  nodes.  Consequently,  the  estimation  is  poor  and  the  feedback  con¬ 
troller  drives  the  robot  aimlessly  in  circles.  The  next  system,  utilizes  the  faulty  hardware  model  and 
does  much  better.  The  estimator  is  able  to  ignore  the  faulty  node  and  is  able  to  reasonably  esti¬ 
mate  the  position  of  the  robot  guiding  it  close  to  the  desired  destination.  System  3  is  an  attempt  to 
compensate  for  the  lag  of  the  network.  For  this  trial,  the  system  promptly  achieves  the  desired  goal 
position,  ffowever,  as  will  be  addressed  later,  this  turns  out  to  not  be  true  in  general.  The  estimation 
tends  to  lags  sufficiently  behind  the  the  robot’s  actual  position  and  causes  instability  in  the  feedback 
system.  The  next  system  switches  to  the  model  predictive  controller  with  the  EKF  and  faulty  node 
filtering.  This  system  achieves  the  goal  with  low  estimation  error.  In  particular,  since  this  system 
only  estimates  the  robot’s  state  after  an  entire  control  sequence  has  been  carried  out,  the  controller 
can  pause  and  allow  laggy  sensor  messages  to  reach  it  before  each  estimation.  Finally,  system  5, 
augments  the  previous  system  with  network  latency  compensation  further  reducing  the  estimation 
error.  This  system  does  not  suffer  with  the  application  of  network  latency  compensation  as  system 
3  does  since  the  MPC  does  not  immediately  react  to  each  incoming  message  and  can  pause  at  the 
completion  of  each  control  sequence. 

The  simulation  results  are  summarized  in  Table  4.1.  Notice,  the  basic  feedback  system  is 
incapable  of  ever  achieving  the  goal.  System  2  achieves  a  significant  improvement  in  performance 
by  using  faulty  node  filtering.  Flowever,  as  previously  mentioned,  network  latency  compensation 
tends  to  induce  instability  for  system  3  and  reduces  the  overall  performance  of  the  controller.  The 
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Path 

td 

West 

ex 

ee 

ftdet 

ftevt 

&evt,pos 

1 

60.5 

i 

3.90 

0.47 

1328.0 

229.1 

3.56 

2 

62.9 

2 

2.77 

1.73 

1186.2 

230.2 

3.82 

3 

65.1 

2 

2.10 

0.53 

1604.8 

266.3 

3.37 

4 

109.6 

8 

1.71 

0.78 

2199.0 

375.8 

4.40 

Table  4.2:  Simulation  results  comparing  4  different  paths  across  20  trials.  Shown  above  is  the  travel 
time  td,  the  mean  number  of  estimations  required  nest,  the  mean  estimated  position  error  ex,  the 
mean  estimated  orientation  error  eg,  the  mean  number  of  detections  n,iei~  the  mean  number  of  event 
messages  sent  hevt,  and  the  mean  estimated  position  error  without  filtering  eevtrPOS. 


last  2  systems  improve  performance  by  replacing  the  feedback  controller  with  the  MPC.  In  particu¬ 
lar,  system  5  achieves  the  best  results.  This  architecture  achieves  the  goal  40%  of  the  time,  requires 
very  little  estimation  overhead,  and  reduces  the  position  and  estimation  error  by  61.7%  and  72.3%, 
respectively,  compared  to  the  basic  feedback  system. 


4.4  Path  Comparison  Simulations 


Using  the  aforementioned  models  and  system  architecture,  several  simulations  are  per¬ 
formed  to  compare  the  effect  path  planning  has  on  robot  localization  accuracy.  In  particular,  using 
the  final  system  architecture  (system  5)  from  the  previous  section,  four  different  routes  to  the  desti¬ 
nation  are  followed.  The  results  are  compared  to  determine  if  path  planning  (exploiting  the  sensor 
network  topology)  significantly  effects  localization  accuracy.  The  simulator  is  configured  to  em¬ 
ulate  the  PEG  sensor  network.  In  particular,  the  parameter  values  used  are  given  in  Appendix  A 
unless  otherwise  noted. 

iT 

For  these  simulations,  20  trials  are  performed  and  the  destination  goal  was  set  to  1 5  15  - 

Example  trials  using  the  four  chosen  routes  are  shown  in  Figure  4.5.  Path  1  simply  goes  directly 
to  the  destination  ignoring  the  sensor  network  topology.  Path  2  is  allowed  to  slightly  deviate  from 
straight  to  encounter  a  region  more  densely  covered  by  nodes.  Path  3  is  allowed  the  same  flexibility, 
but  takes  a  different  path.  Finally,  path  4  attempts  to  enter  as  many  densely  covered  areas  as  possible 
on  the  way  to  the  destination. 

The  results  are  summarized  in  Table  4.2.  As  expected,  path  1  is  the  quickest  path  to 
the  destination  and  the  control  architecture  only  estimates  the  robot  state  once  the  destination  is 
reached.  Since  nodes  are  scattered  to  either  sides  of  this  route,  the  estimated  path  has  high  error,  but 
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(c)  Path  3 


(d)  Path  4 


Figure  4.5:  Example  trial  for  4  different  fixed  routes  depicting  path  and  estimation  results. 
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the  estimated  orientation  has  low  error.  The  second  path  does  well  initially  (see  Figure  4.5b),  but  the 
latter  sections  of  this  path  are  detected  by  nodes  to  the  lower  right  of  the  actual  position.  This  results 
in  a  reduction  in  estimated  position  error,  but  an  increase  in  estimated  orientation  error.  The  third 
path  mends  the  trouble  with  the  latter  part  of  the  second  path  by  routing  down  and  to  the  right  (see 
Figure  4.5c).  This  has  the  effect  of  slightly  increasing  the  number  of  detections  and  achieves  good 
estimation  results.  Path  4  attempts  to  only  navigate  in  dense  regions  and  seeks  reduced  error.  This 
route  does  reduce  the  estimated  position  error  at  the  sake  of  increased  estimated  orientation  error 
and  increased  number  of  detections  (utilization  of  bandwidth).  In  fact,  this  path,  although  chosen 
to  reduce  the  innate  sensor  error  eevtiPOS,  has  actually  increased  it  by  23.6%  over  the  basic  straight 
path.  This  can  also  been  seen  by  Figure  4.5d.  The  first  half  of  the  route  has  high  detection  error 
with  motes  detecting  the  robot  either  too  high  or  too  low.  However,  even  with  poor  detections,  this 
path  achieves  the  lowest  position  estimation  error.  From  this,  we  see  that  it  is  unclear  how  the  robot 
should  choose  its  path.  This  will  be  addressed  in  more  detail  in  later  chapters. 

The  simulations  have  confirmed  that  both  control  architecture  and  path  planning  have 
significant  effect  on  the  system  performance.  Components  of  the  unified  system  architecture  have 
proved  to  be  beneficial  for  SNAC  system  operating  in  realistic  simulations.  However,  it  is  unclear 
how  path  planning  should  be  addressed  to  improve  system  performance.  In  the  following  chapters, 
path  planning  to  achieve  reduced  localization  error  and  bandwidth  utilization  is  investigated. 
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Chapter  5 

Information  Maps 


The  remaining  material  in  this  thesis  focuses  on  robot  localization  in  sensor  networks 
exploiting  an  information  metric.  This  chapter  presents  information  maps,  a  tool  for  determining 
the  localizability  of  a  robot  in  a  region.  Previous  work  [78,  63,  86]  exploiting  information  for  robot 
localization  has  been  in  the  context  of  a  robot  equipped  with  on-board  sensors.  Our  work  applies 
a  global  information  view,  or  information  map,  to  robots  localizing  with  sensor  network  data.  In 
the  following  chapters,  we  develop  sensor  network  models  and  approximations  of  these  models 
suitable  for  information  map  computation.  For  instance,  this  chapter  considers  many  binary  sensors 
distributed  in  a  field;  later  chapters  extend  this  sensor  model  and  develop  approximations  suitable 
for  efficient  computation.  Additionally,  our  work  provides  a  comprehensive  look  at  information 
maps  (formal  presentation  of  the  algorithm  with  time  complexity  analysis),  and  realistic  sensor 
network  localization  simulations  utilizing  information  maps.  Finally,  a  later  chapter  will  develop  a 
novel  path  planning  routine  exploiting  information  that  outperforms  several  other  (non-information 
based)  techniques  for  accurately  localizing  a  robot  in  a  sensor  network. 

This  chapter  provides  an  overview  of  information  maps.  First,  an  analytical  overview 
of  Markov  localization  is  presented.  Next,  the  information  metric  is  introduced.  Then,  using  this 
metric,  an  algorithm  is  presented  that  computes  information  for  all  regions  of  the  robot  pose  space 
generating  an  information  map.  Finally,  several  simulations  are  performed  revealing  the  computa¬ 
tion  time  required  and  providing  insight  into  the  information  topology  of  a  sensor  network. 
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5.1  Markov  Localization 

This  section  describes  how  Markov  localization  [36]  is  used  to  estimate  the  pose,  or  state, 
of  a  robot.  First,  the  system  model,  composed  of  the  robot’s  dynamics  and  the  sensor  network 
model  is  described.  Next,  this  model  is  adapted  for  use  with  Markov  localization.  Finally,  the  steps 
of  localizing  a  robot  using  Markov  localization  are  outlined. 

Markov  localization  is  a  Bayesian  estimation  technique  [49]  for  estimating  a  robot’s  pose 
given  a  model  of  the  system  and  periodic  sensor  readings.  Markov  localization  requires  that  the  pose 
belongs  to  a  finite  space  and  that  the  system  is  specified  probabilistically  using  a  Flidden  Markov 
Model  (HMM).  However,  a  robot’s  pose  often  lives  in  an  infinite,  continuous  space  and  evolves 
according  to  a  set  of  difference  or  differential  equations.  In  order  to  rectify  this  difference,  the 
infinite  pose  space  is  partitioned  and  the  dynamics  are  converted  to  a  HMM. 

First,  the  infinite  pose  space  is  partitioned.  In  the  following,  a  bar  is  placed  over  variables 
related  to  the  infinite  pose  space,  such  as  l  or  £,  to  distinguish  them  from  the  finite  pose  space,  such 
as  /  or  £.  The  infinite  pose  space  £  is  partitioned  into  a  finite  number  of  pose  spaces  £  =  {.£,•}. 
Typically,  this  partitioning  is  based  on  a  regular  grid,  although  other  techniques  such  as  topologi¬ 
cal  [68]  and  tree-based  [16]  are  used.  For  our  work,  the  continuous  pose  space  is  a  A  by  A  square 
anchored  at  the  origin  in  R2.  This  space  is  partitioned  into  a  regular  M  by  M  grid  where  the  (;,  j)lh 
partition,  or  cell,  is  denoted  as  Q,j.  More  precisely,  the  pose  spaces  are  described  by 


£  = 

[0,  A]  x  [0,  A]  c  R2 

(5.1) 

£  = 

(5.2) 

Qi.j  - 

\(i  -  1)A,  ;A|  x  [(j  -  1)A,  jA\  c£cR2 

(5.3) 

A  = 

A 

M 

(5.4) 

where  i,  j  e  [1,2,...,  M).  For  notational  convenience,  (/,  j)  is  used  to  denote  a  pose  /  in  the  partition 
space  that  represents  the  ( i ,  j)th  grid  cell  Qij  with  center  gjj,  inducing  the  following  relationship: 

I  =  ( i,j )  =  Quj  1  e  Qij  <=>  gij  =  Center (§Lj)  (5.5) 

Next,  the  system  dynamics  are  adapted  for  Markov  localization.  The  robot’s  pose  /  e  £ 
is  assumed  to  evolve  according  to  the  difference  equation 


4+1  =  f(h,Uk,k) 


(5.6) 
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Figure  5.1:  A  hidden  Markov  model. 

for  each  time  step  k  e  {0, 1,2, . . .},  and  each  control  value  uk  £  U.  At  each  time  step  k,  our  sensor 
network  measures  the  current  pose  4  as 

yk  =  h(lk,  k)  (5.7) 


where  yk  e  J/. 

Adapting  these  dynamics  to  a  HMM  can  be  challenging.  In  a  later  section,  the  necessary 
steps  are  performed;  for  the  time  being,  it  is  assumed  that  the  system  is  already  adapted  to  a  HMM. 
The  HMM  (see  Figure  5.1)  is  modeled  with  a  hidden  state  lk  e  £,  representing  the  pose,  an  output 
yk  representing  the  sensor  network  measurements,  and  a  transition  input  uk  representing  the  control. 
The  relationships  between  the  variables  are  specified  probabilistically:  the  evolution  from  one  state 
4  to  the  next  4+i  is  described  by  /;(4+i  14,  «4),  the  sensing  relationship  is  described  by  /;(>>/,- 14),  and 
the  initial  pose  at  time  k  =  0  is  described  by  p(Jo).  For  notational  simplicity,  p(z)  and  p(zo)  are 
written  to  indicate  p(Z  =  z)  and  p(Z  =  zo),  respectively. 

Now,  the  Markov  localization  algorithm  is  investigated.  This  algorithm  provides  an  it¬ 
erative  method  for  estimating  the  pose  of  the  robot  given  periodic  sensor  readings.  The  algorithm 
is  developed  by  computing  the  posterior  pose  estimation  (distribution)  given  the  prior  pose  estima¬ 
tion  (distribution),  a  sensor  reading,  and  a  control  input.  The  iterative  computation  is  revealed  by  a 
thoughtfully  chosen  distribution  a: 


a(Jn) 

■-  p(ln,yo,  ■  ■  ■  ,yn\uo, 

(5.8) 

a{l0) 

■=  pGo) 

(5.9) 

This  definition  of  a  allows  an  iterative  computation  of  the  posterior  pose  estimation  at  time  n  +  1 
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given  the  relevant  quantities  at  time  n : 

a(ln+ 1)  =  p(ln+i,yo,---,yn+i\uo,---,un)  (5.10) 

=  P^n+U  yo,  •  •  • ,  yn+l  14,  «(),-••,  Un)p(ln\u0,  un)  (5.11) 

h, 

—  ^  ^  P(yCh  •  •  •  >  33(+l  14+1  >  4,  M  0,  ,  Un)p(ln+\\ln,  Uq,  ...  ,  U  r)p{  l n //(),  .  .  .  ,  )  (5.12) 

h, 

—  /*  ^  ;?CV0,  •  •  ■  >33(14+1,  4,  t<0)  •  •  ■  5  Un)p(yn+ 1  14+1,  4 >  W(),  •  .  .  ,  Un ) 

In 

P(4+ll4,  «0,  .  .  .  ,  M„)p(4l«0,  •  •  •  ,  Un)  (5-13) 

=  ^  PCVO,  •  •  ■  ,33(14,  *<0,  ■  ■  ■  ,  M/j-1  )p(}’n+\  14+1  )p(ln+]  14,  Un)p(ln\U(h  i)  (5.14) 

In 

=  p(ln,yo,  ■  ■  -,yn  |wo, . . . ,  Mn-i)/?(3’«+il4+i)p(4+il4,  un)  (5.15) 

In 

=  pCy„+il4+i)^p(4+il4,Mn)a(4)  (5.16) 

In 

Refeming  to  equation  (5.16),  we  see  that  a(Z„+i)  is  computed  by  first  evolving  a(4)  forward  us¬ 
ing  the  system  dynamics  p(4+il4,  un)  (the  proprioceptive  step),  and  then  by  weighting  the  result 
with  new  sensor  readings  p(yn+\\l,l+\)  (the  preceptive  step).  Once  a(ln+\)  is  computed,  the  pose 
estimation  distribution  is  computed  as 

a{ln+ 1) 


/K4+ 1  |yo>  ■  •  ■  ,  33(+l ,  U0,  ■  ■  ■  ,  Un)  — 


(5.17) 


^j/n+i  tr(/^+l) 

To  reduce  this  distribution  to  a  single  point  estimate,  several  methods  may  be  used  such  as  finding 
the  argmax  or  taking  the  expectation  over  /„+ 1 .  We  prefer  the  later  and  estimate  the  pose  as 


hi+ 1  —  E(p(4+i  lyo,  •  •  • ,  33i+t ,  mo,  ••• ,  Mn))  —  E(  x ,  / 1  ) 


(5.18) 


2j/„+i  0'(4+l) 

Although  conceptually  straightforward,  the  above  algorithm  neglects  to  normalize  a  at 
each  step,  leading  to  large  numerical  errors  over  time.  This  is  solved  by  replacing  a  with  the 
normalized  version  /3: 


PM  := 

p(ln\yo,...,yr,uo,...,um) 

(5.19) 

/?o(/o)  := 

p(k)) 

(5.20) 

Following  steps  similar  to  those  used  above,  we  find  a  2  step  iteration  over  (3: 

PhVn+l) 

—  ^  ^  Pi  ln+ 1 1  hi  ?  ^/2  )/?n_  j  i}n  ) 

In 

P(yn+\ K/n-i)/^(7«+i ) 

H/„+1  f(j«+iI4+iK(4+i) 

(5-21) 

ys;;+1(4+i) 

(5-22) 
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Here  the  proprioceptive  and  preceptive  steps  naturally  reveal  themselves  in  equation  (5.21)  and 
equation  (5.22),  respectively.  To  initialize  this  algorithm,  at  time  n  =  0,  we  start  with  p(lo)  and  a 
sensor  reading  yo-  Since  the  robot  is  assumed  to  be  stationary  prior  to  n  =  0,  u-\  =  0  could  be  used 
in  the  computation,  but  instead,  we  prefer  to  only  compute  the  preceptive  update  for  this  time  step. 
Hence,  we  set  jS[J(Zo)  to  the  prior  p{l o)  to  indicate  that  the  proprioceptive  step  is  skipped.  We  again 
estimate  the  pose  with  an  expectation: 

in+i  =  eos;;:}(/„+i))  (5.23) 

This  section  introduced  our  general  system,  and  applied  Markov  localization  to  it.  The 
next  section  will  investigate  which  regions  of  our  environment  (pose  space)  are  well  suited  to 
Markov  localization. 


5.2  Information 

This  section  introduces  a  metric  for  determining  the  performance  of  Markov  localization 
applied  to  different  regions  of  the  sensor  network.  A  measure  of  a  region’s  ability  to  provide  sensor 
measurements  that  reduce  the  estimation  error  is  presented.  Such  a  measure  is  referred  to  as  infor¬ 
mation  [78]  or  a  localizability  metric  [63].  Information  in  our  context  can  be  understood  by  referring 
back  to  localization.  Recall  that  Markov  localization  provides  more  than  just  the  pose  estimate  /; 
it  provides  the  posterior  distribution  (i  encompassing  all  of  the  knowledge  of  the  current  pose.  In 
particular,  the  [3  distribution  can  be  use  to  determine  the  estimation  error.  Hence,  by  observing  the 
change  in  the  f3  distribution  during  the  preceptive  phase  of  localization,  it  can  be  determined  which 
sensor  readings  are  most  instrumental  in  reducing  estimation  error;  i.e.,  which  regions  have  the  most 
information.  The  development  of  information  is  done  in  3  steps:  entropy  is  defined,  information  is 
defined,  and  system  assumptions  are  imposed  to  simply  the  representation  of  information. 

Entropy  [26]  is  defined  for  a  discrete  distribution  p{z)  as 

(p(z))  =  -2>)  log(p(z))  (5 .24) 

z 

If  p{z)  is  localized  to  one  point,  the  entropy  is  0.  As  p(z)  spreads  out  across  several  points,  its 
entropy  increases  toward  infinity.  Entropy  is  said  to  measure  the  disorder  in  a  distribution;  in  our 
context,  entropy  is  used  to  measure  estimation  error.  Hence,  the  estimation  error  of  a  pose  estimation 
distribution  [3,  is  given  by  its  entropy  'Hi.  [3). 
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Next,  the  change  in  estimation  error  (entropy)  during  the  preceptive  phase  of  localization 
is  computed.  It  is  assumed  that  a  robot  arrives  at  a  pose  Iq  e  computes  the  proprioceptive  pose 
estimate  pi0(l),  receives  a  sensor  reading  y,  and,  computes  the  preceptive  pose  estimate  p/Q(l\y).  The 
notation  pi0(J)  and  pi0(l\y)  is  used  in  place  of  finn(ln+\)  and  respectively,  to  focus  on  a 

single  preceptive  update  occurring  at  the  pose  Iq.  In  other  words,  this  development  is  not  interested 
in  how  the  robot  arrived  at  Iq.  Hence,  it  is  assumed  that  the  proprioceptive  pose  estimate  p/n(l) 
is  known,  thereby  neglecting  all  the  details  leading  up  to  its  computation.  From  this,  information 
I  :  £,  >  [0,  inf)  is  defined  as  the  decrease  in  estimation  error  (entropy)  during  the  preceptive  phase: 

I(lo)  =  'HiPkm  -  'H(Pkdly))  (5-25) 

In  practice,  the  prior  pi0d)  and  the  sensor  reading  y  are  unknown  a  priori.  This  dilemma 
can  be  partially  solved  by  eliminating  y  with  an  expectation: 

m  =  mPiod))  -  EymPkdm  (5.26) 

However,  pi0d)  cannot  be  eliminated  in  such  a  fashion:  it  is  a  complex  quantity,  resulting  from 
potentially  many  steps  of  Markov  localization,  making  it  difficult  to  compute.  Instead,  when  com¬ 
puting  information,  an  approximation  for  p/Q(l)  based  on  observations  of  the  dynamics  is  substi¬ 
tuted.  This  substitution  is  addressed  in  later  sections  along  with  each  particular  implementation.  To 
simplify  our  forthcoming  computations,  2  additional  requirements  are  imposed:  pi0d)  is  solely  a 
function  of  /  -  Iq  and  the  set  of  possible  sensor  values  J/  is  finite.  The  first  requirement  enforces 
that  pi0d)  has  constant  shape;  hence,  it  has  constant  entropy:  let  Ho  =  7T(p/0(/)).  The  second 
requirement  allows  information  to  include  an  (efficient)  summation  over  sensor  values.  With  these 
assumptions,  information  can  be  reduced  further,  arriving  at  the  formulation  determined  by  previous 
work  [78]: 


Ido)  =  Ho  -  EyVH(pk(m\ 

(5.27) 

=  tfo-Z  pi0(y)H(pi0d\y)) 

y 

(5.28) 

=  h0  +  Z  p,0(y )  ^  pi0d\y)iog(pi0d\y)) 

y  l 

(5.29) 

u  ,  v  /  \  v  Pkd,y) ,  rpi0d,y)^ 

=  Ho  +  > ,  pi0(y)  >  .  ,  ,  log(  ) 

V  ^  pio(y)  p/0(y) 

(5.30) 

(5.31) 
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Armed  with  a  formula  for  information,  we  turn  to  detailing  the  steps  needed  to  compute 
it.  Understanding  information  and  entropy  as  measures  is  discussed  more  in  Section  7.1. 

5.3  Information  Map  Computation 

Using  the  analytical  representation  for  information,  this  section  formalizes  the  steps  re¬ 
quired  to  compute  information  across  all  possible  poses.  The  final  representation  (matrix,  grid,  or 
2  dimensional  plot)  is  referred  to  as  an  information  map.  To  compute  the  information  map,  an 
algorithm  is  presented  that  walks  through  the  computation  of  each  term  in  equation  (5.31).  Addi¬ 
tionally,  we  investigate  the  computational  complexity  of  this  algorithm  and  review  the  alternative 
algorithm  (suggested  by  other  researchers  [78]  using  a  traditional  robot  platform)  that  reduces  the 
overall  complexity. 

The  steps  required  to  exactly  compute  the  information  map  are  given  by  Algorithm  1. 
Computing  the  information  given  in  equation  (5.31),  in  turn,  requires  the  computation  of  3  terms: 
//o,  pi0{l,  y),  and  pi0(y).  As  mentioned  in  Section  5.2,  Hq  is  independent  of  /o;  hence,  Hq  is  calculated 
once  at  the  start  of  the  algorithm  (line  1).  The  remaining  2  terms  must  be  calculated  for  each  /o  (line 
2). 

Algorithm  1  Information  Map  Computation 
1 :  compute  Ho 
2:  for  all  /o  e  L  do 
3:  compute  pi0(I) 

4:  for  all  (/,  y)  e  X  x  J/  do 

5:  compute  p(y\l) 

6:  compute  pk(l, y)  =  piQ{l)p(y\l) 

7:  for  all  ye  J/  do 

8:  compute  p,0(y )  =  Zie£  Ph (U  y) 

9:  compute  I(l0)  =  H0  +  X>€.y  Z/sX  P/0(z>  >0^(7^) 


Before  computing  pi0(l,y)  (line  6),  2  more  terms  are  required:  the  prior  pi0(l)  (line  3)  and 
the  sensor  model  p(y\l)  (line  5).  Both  these  terms  are  implementation  (and  model)  specific,  so  the 
computational  details  are  postponed  until  needed.  To  lend  more  traction  to  these  topics,  the  reader 
is  directed  to  Section  5.4  where  pi0(l)  is  modeled  as  a  uniform  distribution  over  a  square  and  /;()]/) 
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is  modeled  as  a  simple  binary  sensor.  In  general,  our  simulations  use  a  simple  prior  distribution 
(uniform  or  normal)  that  is  computationally  easy  to  shift  to  1q  during  each  iteration  of  the  algorithm. 
The  last  term  required  to  compute  information  is  pi0(y)  (line  8)  which  directly  follows  from  piJLy). 
Finally,  information  /(/o)  (line  9)  is  computed  as  directed  by  equation  (5.31). 

A  second  look  at  Algorithm  1  reveals  the  time  complexity.  Working  through  this,  the  time 
required  by  basic  operations  (add,  subtract,  multiply,  log,  etc)  is  replaced  and  reduced  to  constants 
Ck  without  further  explanation.  Since  the  computation  of  pi0(l)  and  p(y\l)  is  model  specific,  the  time 
required  for  these  computations  is  denoted  as  tp  and  ts,  respectively.  Finally,  since  £,  and  J/  are 
finite  sets,  their  cardinalities  are  denoted  as  n/  and  ny,  respectively. 

Starting  at  the  beginning  of  the  algorithm,  note  that  Hq  is  an  entropy  computation,  which 
entails  «/  logs,  «/  multiplications,  and  «/  -  1  subtractions.  Hence,  the  total  time  required  for  Hq 
is  c\ni  -  C2-  The  loop  on  line  2  requires  /?/  iterations,  which  serves  as  a  multiplicative  factor  for 
the  remaining  computation.  Rolling  lines  3  through  6  together  requires  tp  +  niny(ts  +  C3)  where  C3 
represents  the  multiplication  required  for  pi0(l,y).  Similarly,  lines  7  and  8  entails  ny(cpij  -  C4)  since 
for  each  y,  piQ(y)  entails  «/  -  1  additions  at  a  cost  of  C4.  Finally,  line  9  requires  niny  divisions,  logs, 
and  multiplications  and  (ii/  -  1  ){ny  -  1)  +  1  additions,  totaling  c$nyni  -  C(,(nj  +  ny)  +  c2.  Putting  these 
terms  together  (with  the  loop  on  line  2),  the  time  complexity  of  Algorithm  2  is 

(cpi/  -  c2)  +  n,[tp  +  niny(ts  +  c3)  +  ny(cpit  -  c4)  +  ( c5nytii  -  C(,{n,  +  ny)  +  c7)]  (5.32) 

Further  reducing  and  combining  constants,  the  big -O  time  complexity  is 

nj[ny(ts  +  c8)  -  c6]  -  c9n,ny  +  n,(tp  +  ci0)  -  c2  £  0(njnyts  +  nttp)  (5.33) 

where  we  have  intentionally  left  tp  and  ts  in  the  big-O  notation  since  these  terms  are  potentially 
functions  of  11 /  and  ny.  Note,  the  first  and  second  terms  of  the  complexity  are  associated  with  com¬ 
putation  of  the  sensor  model  and  the  prior,  respectively.  If  tp  and  ts  are  constants  (as  they  are  for 
many  of  our  simulations),  the  time  complexity  reduces  to  0(njny).  Hence,  a  practical  implementa¬ 
tion  of  Algorithm  1  must  carefully  implement  the  most  deeply  nested  computations  (lines  5  and  6) 
in  order  for  the  algorithm  to  remain  feasible. 

To  reduce  the  time  spent  in  lines  5  and  6,  Algorithm  2,  an  approximate  information  map 
algorithm  (developed  by  other  researchers  [78]  using  a  traditional  mobile  robot  platform),  is  re¬ 
viewed.  This  algorithm  exploits  tacit  assumptions  about  locality  within  the  system.  In  particular, 
since  the  prior  pi0(l)  gains  most  of  its  support  within  a  small  neighborhood  of  Iq,  information  compu¬ 
tation  is  restricted  to  this  neighborhood.  More  specifically,  when  p/0(l)  is  generated,  an  approximate 
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Algorithm  2  Approximate  Information  Map  Computation 
1:  compute  Ho 

2:  for  all  /o  e  L  do 

3:  compute  p/0(l)  and  the  approximate  support  set  L/0 

4:  compute  the  approximate  set  of  sensor  readings  Y/0 

5:  for  all  (/,  y)  £  LxY  do 

6:  compute  p(y\l) 

7:  compute  ph(l, y)  =  pi0(l)p(y\l) 

8:  for  all  v  e  Y  do 

9:  compute  p,0(y)  =  £/eL  Pl0(l  y ) 

10:  compute  I(l0)  =  Ho  +  2V€K  ZieL Pk(l’y)lo8(^y) 


support  set  L/0  c  £,  for  p/0(Z)  containing  the  most  relevant  points  is  also  generated.  Then,  a  reduced 
set  of  possible  sensor  values  T/0  based  on  L/0  is  generated.  The  reduced  sensor  space  is  the  primary 
contributor  to  reduced  computation  time.  These  optimizations  are  particularly  well  suited  for  the 
sensor  network  model  where  a  robot  only  excites  nearby  sensor  nodes.  In  effect,  this  algorithm 
switches  from  computing  information  at  lo  using  the  entire  space  XxJ/  to  using  only  a  localized 
space  L/0  x  F/0. 

Now,  it  is  shown  that  reducing  the  computation  space  dramatically  reduces  the  time  com¬ 
plexity.  The  time  required  to  compute  the  prior  and  its  support  set  is  denoted  by  tp.  Additionally,  the 
time  required  to  compute  the  reduced  sensor  measurement  space  L/0  is  denoted  by  ty.  Finally,  the 
cardinality  of  the  reduced  spaces  L/0  and  Y/0  is  denoted  by  iy  and  ny,  respectively.  Walking  through 
Algorithm  2  in  a  fashion  similar  to  above  computations,  the  time  complexity  is  found  to  be 

(c\ni  -  C2 )  +  nt[tp  +  ty  +  hjhy{ts  +  C3)  +  hy{c^hi  -  C4 )  +  (c^hyhi  -  C(,(hi  +  hy)  +  C7)]  (5.34) 
=  nihi[hy(ts  +  c^)  -  c6]  -  c9nihy  +  ni(tp  +  ty  +  cio)  -  c2  (5.35) 

e  0(nihihyts  +  ni(tp  +  ty))  (5.36) 

Comparing  equation  (5.36)  and  equation  (5.33),  it  is  clear  that  the  approximate  algorithm  has  re¬ 
duced  the  cost  associated  with  the  sensor  model  and  potentially  increased  the  cost  associated  with 
the  prior  and  sensor  measurement  space.  Again,  if  ts,  tp,  and  t\  are  constants,  the  time  complexity 
reduces  to  «/7i/nv  resting  on  the  shoulders  of  the  sensor  model.  Since  the  cardinality  of  the  reduced 
spaces  L/0  and  T;0  is  much  smaller  than  the  whole  space,  the  time  complexity  has  been  significantly 
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reduced.  Based  on  this  evidence,  our  work  uses  exclusively  Algorithm  2  for  simulations.  Furnished 
with  a  feasible  method  of  computing  information  maps,  the  next  section  presses  forward  with  sev¬ 
eral  simulations. 


5.4  Simulations 

Flaving  previously  traced  through  all  the  steps  necessary  to  compute  an  information  map, 
this  section  computes  several  such  maps  for  a  sensor  network  model.  First,  a  basic  robot  prior  and 
sensor  network  model  are  presented.  Then,  an  information  map  is  computed  for  this  system  and  is 
discussed  along  with  the  required  CPU  time.  Finally,  to  illuminate  the  effect  a  prior  model  has  on 
the  information  map,  several  maps  with  varying  priors  are  computed  and  discussed. 


Field 

size 

100m  x  100m 

grid 

200  x  200 

sensor  distribution 

uniform 

Ns 

30 

Sensor 

model 

discrete, binary 

rs 

10  cells 

Ps 

0.75 

Estimator 

prior  model 

discrete,  box 

h 

varies 

Table  5.1:  System  models  and  parameters  used  for  computing  several  information  maps. 


A  summary  of  the  system  details  is  provided  in  Table  5.1.  The  simulated  sensor  network 
has  Ns  =  30  sensors  uniformly  distributed  on  the  playing  field  that  spans  100m  by  100m  and  is 
divided  by  a  200  by  200  regular  grid.  The  global  sensor  model  p(y\l)  is  an  independent  combination 
of  all  the  node  sensor  models: 

Ns 

p(y\l)  =  Y\p(y>n\l)  (5.37) 

m—  1 

Each  sensor  is  represented  by  the  discrete  binary  sensor  model.  For  the  mth  node,  during  each  sens¬ 
ing  period,  this  model  assumes  that  the  sensor  either  reports  a  detection  (ym  =  ©)  with  probability 
ps  or  remains  silent  (ym  =  0)  with  probability  1  -  ps.  More  precisely,  the  detection  distribution  for 
the  mth  sensor,  with  sensing  radius  rv,  located  at  zm  e  £.  is  given  by 


p(ym  -  ©10  =  < 


Ps  if  11/  -  zm\\  <  rs 


0  otherwise 


(5.38) 
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where  ||  ■  ||  is  the  infinity  norm  on  the  partition  space  £  and  is  the  larger  of  either  cells  horizontally  or 
cells  vertically.  In  this  norm,  a  radius  of  r  covers  a  2r  +  1  by  2r  +  1  square  of  cells;  hence,  d  =  2r  +  1 
is  referred  to  as  the  diameter  of  such  a  set  of  cells. 

A  binary  sensor  model  may  seem  trivial,  but  it  accurately  captures  the  details  of  certain 
mote  sensors  used  in  real  deployments.  Indeed,  during  PEG,  the  magnetometer  sensor  (with  an 
r 3  response  drop-off  and  threshold  detection)  amounted  to  a  binary  detector.  Other  researchers  [4] 
have  also  identified  the  usefulness  of  a  binary  sensor  model  in  sensor  network  environments.  For 
this  simulation,  the  sensor  has  a  detection  radius  of  rs  =  10  cells  and  a  detection  probability  of 
Ps  =  0.75. 

The  robot  prior  pi0(l)  is  taken  to  be  uniform  on  a  square  of  grid  cells.  In  particular,  pin(J) 
is  represented  by  the  discrete  box  prior  model  with  a  radius  of  rp  cells: 

Pl0(D  ~  G({/‘  -rp,l10-fp  +  l,...,ll0  +  rp}x{l20-fp,l20-fp  +  l,...,l20  +  fp ))  (5.39) 

Hence,  p/0(Z)  is  uniform  across  a  dp  by  dp  square  of  cells  centered  at  Zq  where  dp  =  2 rp  +  1. 

Before  the  simulation  can  commence,  the  computation  of  L/0  and  F/0  must  be  outlined. 
Computing  L/0  is  straightforward:  the  exact  support  for  pi0(J)  (i.e.,  the  dp  by  dp  square  centered  at 
Zo)  is  used.  To  compute  T/0  cjZ  =  {y  =  (y\,y>2,  ■  ■  •  ,yNs)\ym  £  {0,®}},  observe  that  sensors  close  to 
Zo  can  either  report  (ym  =  ©)  or  not  report  (ym  =  0),  whereas,  sensors  far  from  Zo  will  not  report 
(ym  =  0).  Hence  we  let  T/0  be  such  a  set: 


Y,0  =  {y  =  (yuyi, . . .  ,yNs)\ym  e  Y£} 


(5.40) 


where 


{0,  ©}  if  ||/o  —  Zmll  <  rs  +  rp 
{0}  otherwise 


(5.41) 


Using  the  above  details,  the  simulation  proceeds  for  varying  prior  radii.  The  informa¬ 
tion  map  computed  with  rp  =  4  cells  is  displayed  in  Figure  5.2  as  both  a  2-dimensional  and  3- 
dimensional  plot.  In  the  2-dimensional  plot,  nodes  and  their  sensing  regions  are  identified  by  white 
dots  and  dark  blue  dashed  lines,  respectively.  The  color  background  on  the  plots  represent  the 
amount  of  information.  These  2  plots  with  a  vibrant  color  scheme  are  included  to  highlight  the 
variation  in  information  across  our  playing  field.  In  particular,  information  poor  regions  occur  in 
2  places:  far  from  sensor  coverage  and  close  to  node  locations.  For  regions  far  from  sensing  cov¬ 
erage  it  is  intuitive  that  the  robot  gleans  no  estimation  help.  In  these  regions,  nodes  do  not  report 
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(a)  3-dimensional  view  (b)  2-dimensional  view 


Figure  5.2:  An  information  map  computed  using  discrete,  binary  sensors  and  an  uniform  prior  with 

radius  rp  =  4  cells  as  detailed  in  Table  5.1. 

detections  which  is  expected  by  Markov  localization;  hence,  no  information  is  provided  to  the  es¬ 
timator.  Flowever,  it  may  seem  surprising  that  no  information  is  gained  by  a  robot  very  close  to  a 
node.  This  occurs  when  a  robot  next  to  a  node  has  its  prior  entirely  enclosed  in  the  node’s  sensing 
radius.  Whether  the  node  detects  the  robot  or  not  is  of  no  consequence  to  the  robot  since  it  already 
knows  its  pose  with  a  greater  accuracy  than  the  node  can  provide.  Turning  to  the  information  rich 
regions,  we  might  have  guessed  that  these  regions  were  at  the  center  of  nodes,  but  that  fallacy  has 
been  dispelled.  Instead,  the  highest  information  regions  are  those  close  to  a  node’s  edge  of  detec¬ 
tion,  or,  better  yet,  in  regions  close  to  several  sensing  edges.  In  these  regions,  the  robot’s  prior  tends 
to  be  partially  in  and  partially  out  of  a  sensing  region.  If  the  mote  reports,  the  estimation  routine 
only  retains  the  outside  part  of  the  prior;  if  the  mote  does  not  report,  only  the  inside  part  is  retained. 
Hence,  these  regions  provide  a  good  way  for  the  robot  to  quickly  shave  off  part  of  its  prior  and  re¬ 
duce  its  estimation  error.  In  a  nutshell,  the  robot  is  better  off  near  detection  edges  where  the  chances 
of  being  detected  and  going  undetected  are  roughly  equal.  Hence,  a  robot  that  navigates  between 
several  sensing  regions  without  being  detected  often  attains  a  lot  of  information.  Later  sections  will 
show  that  robots  navigating  intelligently  around  nodes  often  do  better  than  ones  navigating  close  to 
nodes. 

Information  maps  computed  for  6  different  prior  radii  (  rp  e  { 1, 2, 4, 7, 12, 17})  are  shown 
in  Figure  5.3.  These  plots,  and  all  the  remaining  information  plots  in  this  paper,  use  a  2-dimensional 
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Figure  5.3:  Information  maps  computed  using  discrete,  binary  sensors  and  an  uniform  prior  with 
varying  radius  rp  as  detailed  in  Table  5.1. 
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representation.  Additionally,  to  increase  the  legibility  of  future  plots  that  include  more  symbols,  the 
plots  switch  to  a  muted  color  scheme  of  blue  tones.  These  plots  show  that  as  the  prior’s  radius 
increases,  fewer  regions  in  the  map  are  void  of  information.  This  occurs  because  a  large  prior 
has  a  good  chance  of  intersecting  one  or  more  sensing  regions,  and,  regardless  of  what  the  sensor 
network  reports,  the  large  prior  can  be  trimmed  down.  In  particular,  notice  that  with  rp  e  {1,2, 4, 7), 
the  center  of  a  node’s  sensing  region  has  a  information  null  since  the  robot’s  prior  is  small  enough  to 
fit  entirely  in  the  sensing  region.  However,  once  the  prior’s  radius  rp  exceeds  the  sensing  radius  rs 
in  the  last  2  plots,  the  entire  node’s  sensing  region  becomes  useful.  However,  the  sensing  center  still 
remains  a  relatively  low  information  region,  contradicting  the  intuitive  notion  that  a  robot  benefits 
the  most  by  driving  close  to  nodes.  Additionally,  these  plots  underscore  the  importance  of  choosing 
an  accurate  prior  for  ensuring  that  information  maps  are  representative  of  localizability. 

The  approximate  information  map  algorithm  is  carried  out  in  MATLAB  R14  on  a  2.6GHz 
P4  with  1GB  RAM.  The  computation  time  required  for  each  simulation  is  summarized  in  Table  5.2. 
The  mean  time  required  to  compute  each  map  steadily  increases  from  17.01  minutes  for  a  small 
prior  to  116.77  minutes  for  a  large  prior.  Furthermore,  the  mean  time  required  to  update  each  cell 
varies  from  25.5  ms  to  175.2  ms.  This  indicates  that  a  robot  can  update  the  information  map  in  real 
time  to  compensate  for  a  changing  environment.  For  instance,  if  a  node  goes  online  or  offline,  the 
information  map  should  be  updated  in  the  2 (rp  +  rs)  +  1  diameter  square  centered  at  this  node.  For 
our  simulations  with  rs  =  10  cells  and  rp  =  4  cells,  the  map  could  compensate  for  such  a  node 
(updating  a  29  by  29  region  of  the  map)  in  about  26.6  seconds.  Hence,  occasional  changes  in  the 
environment  could  be  accounted  for  in  real  time. 


rp 

cell  computation 

grid  computation 

mean  (ms) 

total  (s) 

total  (min) 

1 

25.5 

1020.72 

17.01 

2 

28.9 

1154.50 

19.24 

4 

31.6 

1264.28 

21.08 

7 

51.1 

2044.17 

34.07 

12 

98.6 

3942.63 

65.71 

17 

175.2 

7006.23 

116.77 

Table  5.2:  Time  required  by  a  2.6GHz  Pentium4  with  1GB  RAM  running  MATLAB  R14  to  compute 
an  information  map  using  the  parameters  from  Table  5.1. 


This  chapter  presented  an  overview  of  information  maps  for  sensor  networks.  Markov 
localization  and  information  were  reviewed  analytically  and  algorithmically.  Special  attention  was 
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paid  to  the  time  complexity  of  computing  information  for  an  entire  region.  A  system  model  for  a 
robot  localizing  in  a  sensor  network  was  developed  and  several  simulations  were  performed  with 
this  model.  The  simulation  results  provide  evidence  that  robots  localizing  in  sensor  networks  will 
benefit  by  navigating  close  to  sensor’s  edge  of  detection.  The  next  chapter  investigates  this  idea 
more  formally,  and  compares  the  localization  accuracy  of  robots  that  obey  and  disregard  this  idea. 


63 


Chapter  6 

Localization  using  Information  Maps 


Now  that  the  generation  of  information  maps  has  been  throughly  studied,  the  remaining 
work  will  focus  on  applying  information  to  aid  in  localization.  Our  work  methodically  builds  up 
to  simulations  of  realistic,  closed-loop  navigation  systems  starting  with  simple,  open-loop,  zero 
noise  systems.  This  chapter  focuses  strictly  on  open-loop  systems  to  validate  the  applicability  of 
information  maps.  First,  the  steps  required  to  compute  localization  updates  needed  by  the  simulator 
are  presented.  Next,  an  initial  localization  simulation  is  perused  and  used  to  demonstrate  the  ability 
of  an  information  based  path  to  improve  localization  performance  over  other  path  techniques.  After 
this,  the  basic  system  models  are  relaxed  to  more  realistic  models  and  are  adapted  for  use  with 
information  maps  and  Markov  localization.  Finally,  the  new  models  are  used  to  evaluate  the  ability 
of  information  based  paths  to  improve  localization  accuracy  and  confidence. 

6.1  Markov  Localization  Revisited 

Before  beginning  any  localization  simulations,  this  section  takes  a  methodological  look  at 
Markov  localization.  Using  the  analytical  introduction  to  Markov  localization  from  Section  5.1,  this 
section  outlines  the  necessary  computational  steps.  Additionally,  the  time  complexity  is  analyzed 
and  an  approximate  alternative  to  exact  Markov  localization  is  reviewed. 

The  presentation  of  the  Markov  localization  computation  first  walks  through  the  iterative 
update  for  time  n  +  1 ,  and  then  returns  to  discuss  the  initialization  of  the  algorithm  at  time  0.  The 
update  algorithm  for  time  n  +  1  is  shown  in  Algorithm  3.  The  algorithm  is  initialized  (line  1)  with  a 
new  sensor  reading  yn+i,  the  last  control  value  un,  and  the  previous  iteration’s  posterior  distribution 
/("_](/„).  The  remainder  of  the  algorithm  computes,  in  order,  the  proprioceptive  update  given  by 
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equation  (5.21),  the  preceptive  update  given  by  equation  (5.22),  and  the  expected  pose  given  by 
equation  (5.23).  For  each  possible  new  pose  Z„+i,  the  proprioceptive  update  f3"(ln+i)  is  achieved  on 
line  6.  Prior  to  this,  the  transition  probability  p(ln+\\ln,  un)  is  computed  (line  4)  and  applied  (line 
5)  for  each  possible  previous  pose  Z„.  In  general,  the  computation  of  p(ln+i\ln,  un)  must  be  handled 
with  care  to  ensure  this  part  of  the  update  algorithm  is  feasible  for  a  large  pose  space  £.  Since  this 
computation  is  dependent  on  the  dynamics,  the  computational  details  arc  addressed  later  for  specific 
models.  Next,  the  algorithm  prepares  to  compute  the  preceptive  update  (achieved  on  line  1 1).  First, 
the  sensor  model  p(yn+i\ln+i)  is  computed  (line  7)  using  the  new  sensor  reading,  and  the  result 
is  merged  with  the  proprioceptive  distribution  (line  8).  Again,  the  computation  of  p(yn+i\ln+i)  is 
dependent  on  the  system  model;  hence,  the  details  are  delayed  until  necessary.  Next,  the  algorithm 
exits  the  loop  over  ln+\,  and  computes  the  normalization  constant  Ux  (line  9).  Finally,  for  each 
new  pose  ln+\,  the  preceptive  update  is  computed  (line  1 1).  The  last  step  (line  12)  is  to  compute  the 
estimate  of  the  pose  using  an  expectation  of  the  posterior  /3"+1(Z„+i). 

To  initialize  the  algorithm  at  time  0,  only  a  preceptive  update,  as  mentioned  in  Section  5.1, 
is  computed.  In  particular,  given  a  prior  p(Zo)  and  a  new  sensor  reading  yo,  the  proprioceptive  update 
computation  (lines  3-6)  is  skipped  and  /?{[(Zo)  is  set  to  the  prior  p{l o).  After  which,  normal  operation 
is  resumed  to  compute  /^(Zo)  and  1q.  The  computational  details  of  the  prior  p(l o)  are  delayed  until 
necessary. 


Algorithm  3  Markov  localization  update 
1:  given /3”_j (/„),  yn+ 1,  and  un 
2:  for  all  /„+ 1  e  £  do 
3:  for  all  ln  €  £  do 

4:  compute  p(l„+i\l„,  un) 

5:  compute  y(/„+i,  /„)  =  p(ln+i\ln,  unWn_x{ln) 

6:  compute  j8£(Z„+i)  =  Z/„  jUn+lJn) 

7:  compute  p(y„+i\ln+i) 

8:  compute  <p(ln+l)  =  p(yn+l\ln+l)K(ln+x) 

9:  compute  £n+i  =  2z„+1  ip{ln+ 1) 

10:  for  all  ln+ 1  e  £  do 

11:  compute /?;;+ '(/„+!)  =  <p(ln+ !)/£,+! 

12:  tn+i  =  eos;;+1(z„+1)) 
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Walking  through  Algorithm  3,  the  time  complexity  is  computed  using  the  same  procedure 
as  that  used  in  Section  5.3.  We  find  the  big -O  time  complexity  as 

ni[ni(ti  +  ci)  +  (ni  -  1  )c2  +  ts  +  c3]  +  {nt  -  1  )c4  +  c5n/  +  (nt  -  l)c6  (6.1) 

=  nf(ti  +  ci)  +  ni(ts  +  cs)  +  C9  (6.2) 

e  Oinjti  +  nits)  (6.3) 

where  and  ts  are  the  times  required  to  compute  the  transition  distribution  p{ln+\\ln,un)  and  the 
sensor  model  p{yn+i\ln+\)>  respectively.  Since  the  computation  of  both  the  transition  and  sensor 
model  can  greatly  impact  the  running  time  of  Markov  localization,  these  computations  must  be 
efficient.  In  particular,  the  time  required  for  the  transition  model  must  be  limited  since  it  is  magnified 
by  nj.  The  time  required  to  compute  the  transition  and  sensor  distributions  is  discussed  in  later 
sections  along  with  the  accompanying  system  models. 

If  1 1  and  ts  arc  constants,  the  big -O  complexity  reduces  to  nj.  This  term  represents  the  size 
of  the  space  to  be  updated.  In  order  to  reduce  this  term,  our  simulations  apply  the  selective  update 
approach  from  the  literature  [10].  This  approach  approximates  the  preceptive  update  step  by  only 
using  the  previously  detailed  update  algorithm  for  cells  with  /?"(/„+ 1)  larger  than  a  threshold.  The 
remaining  cells  are  updated  with  the  simple  rule 

/?;;+1(z,i+1)  =  Afir„(in+ 1)  (6.4) 

where  A  is  chosen  to  normalize  the  resulting  /3"+1(/«+i)  distribution.  For  our  simulations,  this  ap¬ 
proach  can  dramatically  reduce  the  time  required  for  sensory  updates. 

6.2  Simulations 

This  section  and  the  remainder  of  this  chapter  are  dedicated  to  using  information  maps  to 
improving  robot  localization  in  sensor  networks.  After  discussing  information  maps  (Sections  5.2, 
5.3,  and  5.4)  and  the  details  of  robot  localization  (Sections  5.1  and  6.1),  this  section  combines  the 
two  concepts  in  simulation.  First,  a  full  system  model,  now  including  robot  dynamics,  is  presented. 
Then,  an  initial  set  of  localization  simulations  are  performed.  Finally,  the  results  are  discussed. 

The  localization  simulations  entail  a  robot  traversing  a  sensor  network,  receiving  mea¬ 
surements,  and  estimating  its  location.  These  simulations  are  specified  by  several  details: 


•  the  field  and  sensor  layout 
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Field 

size 

100m  x  100m 

grid 

200  x  200 

sensor  distribution 

uniform 

Ns 

30 

Sensor 

model 

discrete,  binary 

rs 

10  cells 

Ps 

0.75 

Estimated  Prior 

prior  model 

discrete,  box 

h 

4  cells 

Estimator 

prior  model 

discrete,  box 

rP 

50  cells 

Dynamics 

model 

discrete,  adjacent  cell 

U 

4  cells 

Pose 

lo 

5  10 

r 

Igoal 

70  90 

T 

Simulation 

trials 

100 

path  type 

xy,  yx,  straight,  node  centers,  manual 

Table  6.1:  System  models  and  parameters  used  during  our  initial  localization  simulations. 

•  the  sensor  model 

•  the  information  map  parameters  (estimated  prior) 

•  the  initial  conditions  of  the  estimator 

•  the  robot  path 

•  the  robot  dynamics 

For  our  initial  simulations,  these  details  are  summarized  in  Table  6.1,  but  require  more  explanation. 
Much  of  the  setup  is  inherited  from  the  previous  information  map  simulations  in  Section  5.4.  More 
specifically  a  100m  by  100m  field  with  30  sensors  uniformly  distributed  is  used.  Each  sensor  uses 
the  discrete  binary  sensor  model  with  a  detection  radius  rs  —  10  cells  and  a  detection  probability 
ps  of  0.75.  The  information  map,  computed  before  the  localization  simulation,  uses  the  discrete 
box  prior  model  with  radius  rp  =  4  cells.  The  estimator  is  initialized  with  the  robot  prior  p(lo) 
defined  as  a  discrete  box  prior  with  radius  rp  =  50  cells  centered  about  the  robot’s  initial  position. 
Notice,  the  distinction  between  rp  and  rp:  rp  is  the  initial  radius  of  the  prior  during  simulation,  rp 
is  the  prior  radius  assumed  during  computation  of  the  information  map.  The  route  the  robot  takes 
is  varied  across  simulations  to  compare  localization  error  associated  with  different  paths.  Flowever, 
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the  robot  is  required  to  start  at  pose  Iq  (in  the  lower  left)  and  end  at  pose  lgoai  (in  the  upper  right). 
The  robot  model  is  defined  using  the  discrete  adjacent  cell  dynamics.  The  discrete  adjacent  cell 
dynamics  specify  that  a  robot’s  pose  l  e  £  evolves  according  to 

4+t  =  4  +  Uk  (6-5) 

where 

uk  6  flA  =  {-U,  -U  +  1, . . . ,  0, . . . ,  U  -  1,  U}2  c  Z2  (6.6) 

In  other  words,  the  robot  may  move  up  to  U  cells  (4  cells  for  this  simulation)  both  horizontally  and 

vertically. 

To  test  the  ability  of  the  information  map  to  predict  localizability,  several  different  robot 
path  schemes  are  tested: 

•  xy:  drive  in  the  x-direction,  then  in  the  y-direction 

•  yx:  drive  in  the  y-direction,  then  in  the  x-direction 

•  straight:  drive  in  a  straight  line 

•  node  centers:  drive  through  several  sensing  centers  on  the  way  to  the  destination 

•  manual:  using  the  information  map,  a  human  chooses  a  path  that  takes  the  robot  through  high 
information  regions 

The  first  3  path  schemes  represent  basic  paths  that  the  robot  might  use  to  get  to  the  destination.  These 
schemes  assume  no  information  is  known  about  the  environment.  The  next  scheme,  node  centers, 
is  generated  by  a  path  planner  that  tries  to  drive  through  the  center  of  several  sensing  regions  on  its 
way  to  the  destination.  This  path  planner  is  described  in  more  detail  in  Section  7.2.  For  now,  it  is 
assumed  that  this  path  is  provided.  The  final  path  scheme,  manual,  represents  a  path  chosen  by  us 
after  looking  at  the  information  map.  This  scheme  is  an  attempt  by  a  human  to  maximize  the  overall 
information  the  robot  encounters  on  its  way  to  the  destination. 

Since  this  is  the  first  set  of  localization  simulations,  the  mechanics  and  results  of  the  simu¬ 
lation  are  perused.  Prior  to  running  the  localization  simulation,  the  information  map  is  computed  as 
described  in  the  previous  chapter.  The  result  is  shown  in  Figure  6.1.  In  this  case,  3  high  information 
regions  are  obvious  in  the  field:  in  the  upper  left,  in  the  lower  right,  and  part  way  up  the  (0, 0)- 
(100, 100)  diagonal.  If  the  information  maps  are  a  good  measure  of  localizability,  a  robot  passing 
through  these  regions  should  have  less  estimation  error  than  a  robot  that  avoids  these  regions.  To  test 
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Figure  6.1:  Information  map  for  our  initial  localization  simulations.  White  dots  outlined  in  black 
represent  motes  and  dotted  squares  represent  their  detection  region.  Background  color  of  the  plot 
represents  the  amount  of  information. 
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Path  Type 

Length  (m) 

Travel  Time  (s) 

nd 

ntj  1 00  m 

xy 

145.00 

7.30 

19.72 

13.60 

yx 

145.00 

7.30 

29.44 

20.30 

straight 

103.08 

7.10 

4.56 

4.42 

node  centers 

252.66 

20.40 

133.92 

53.00 

manual 

289.31 

15.40 

43.20 

14.93 

Table  6.2:  Path  information  for  each  path  scheme  across  all  trials  during  our  initial  localization 
simulations. 


Path  Type 

Estimation  Error  (m) 

Estimation  Entropy 

Min 

Mean 

Max 

Min 

Mean 

Max 

xy 

2.15 

2.39 

3.37 

2.13 

2.56 

3.84 

yx 

2.55 

2.69 

3.51 

3.00 

3.33 

4.41 

straight 

9.27 

9.29 

9.40 

6.98 

6.99 

7.01 

node  centers 

0.80 

0.95 

1.38 

1.11 

1.23 

1.51 

manual 

0.65 

0.73 

1.03 

0.57 

0.63 

0.85 

Table  6.3:  Estimation  error  and  entropy  for  each  path  scheme  across  all  trials  during  our  initial 
localization  simulations. 


this,  a  robot  is  simulated  following  each  of  the  aforementioned  paths  100  times.  Each  trial  records 
various  statistics  about  the  estimation  performance.  Each  path  scheme,  estimated  path  information, 
and  information  map  are  displayed  in  Figure  6.2.  These  plots  depict  an  overhead  view  of  each  path 
(solid  black  line)  superimposed  on  the  information  map  (background  color).  The  white  circles  and 
squares  represent  the  starting  and  ending  positions  of  the  robot,  respectively.  The  light  green  line 
shows  the  estimated  path  for  a  typical  trial.  A  typical  trial  is  a  trial  such  that  the  difference  between 
its  mean  estimation  error  and  the  mean  estimation  error  across  ah  trials  is  minimal.  For  ah  paths, 
but  the  straight  path,  the  typical  estimated  path  eventually  converges  to  the  actual  path  before  the 
destination  is  achieved.  The  straight  path  misses  the  detection  region  of  all  but  a  few  motes  making 
localization  difficult.  Shortly,  we  will  investigate  more  precisely  how  this  spoils  the  localization 
performance.  Before  leaving  this  figure,  note  that  the  node  centers  and  manual  paths  tend  to  take 
a  similar  route  that  is  also  longer  than  other  paths.  These  2  paths  provide  an  interesting  point  of 
comparison  allowing  us  to  quantify  how  much  the  localization  accuracy  can  benefit  with  smarter 
paths. 

Across  ah  trials,  for  each  path,  several  statistics  are  collected:  the  path  length,  the  path 
travel  time,  the  mean  number  of  detections  hd,  the  mean  number  of  detections  per  100  meters  hd! 100 
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Figure  6.2:  Basic  simulation  overview  plots  for  our  initial  localization  simulations  depicting  the 
robot  starting  (white  circle)  and  ending  (white  square)  positions,  the  robot  path  (solid  black  line), 
and  a  typical  estimated  robot  path  (solid  light  green  line)  superimposed  on  the  information  map 
(background  color).  Each  figure  is  captioned  with  the  path  scheme  depicted. 
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m,  and  the  bounds  and  the  mean  of  both  estimation  error  and  estimation  entropy.  These  statistics 
are  compiled  in  a  path  information  table  (Table  6.2)  and  an  estimation  information  table  (Table  6.3). 
Referring  to  Table  6.2,  notice  the  large  range  in  number  of  detections  per  100  meters.  The  straight 
path  only  receives  an  average  of  4.42  detections  per  100  meters,  which  results  in  high  estimation 
error  and  estimation  entropy  (see  Table  6.3).  On  the  other  end  of  the  spectrum,  is  the  node  centers 
path  which  generates  an  average  of  53.00  detections  per  100  meters  by  driving  through  the  center 
of  detection  regions.  In  fact,  this  scheme  does  reduce  the  estimation  error.  One  look  at  the  manual 
path  scheme,  however,  indicates  that  more  detections  does  not  always  implies  less  error.  The  manual 
path,  in  comparison  to  node  centers,  is  able  to  achieve  lower  estimation  error  with  fewer  detections. 
Let’s  investigate  the  mechanisms  underlying  this  relationship  by  turning  to  the  basic  paths. 

Note  that  yx  undeiperforms  xy,  even  though  yx  receives  49.3%  more  detections.  There  are 
2  mechanisms  contributing  to  this  phenomena:  the  time  lapse  until  first  detection  and  the  location 
of  detections  with  respect  to  the  current  (3  distribution.  The  xy  path  takes  the  robot  immediately 
through  high  information  regions  formed  at  the  edge  of  several  detection  regions  (Figure  6.2a).  The 
yx  path  scheme  (Figure  6.2b),  however,  requires  more  time  before  the  robot  encounters  a  detection 
region,  but  has  the  robot  drive  through  the  center  of  a  detection  region.  For  all  schemes,  the  prior 
starts  out  as  a  uniform  distribution  over  a  large  square  region.  As  the  robot  progresses  through 
the  field,  this  region  is  reduced  by  detections  (or  lack  of  detections)  eliminating  possible  poses  and 
improving  estimation  accuracy.  Since  the  prior’s  support  region  is  initially  large,  often  the  best  way 
to  quickly  reduce  it  is  to  get  a  detection.  This  first  detection  reduces  the  [3  distribution  to  (less  than) 
the  sensing  region  of  the  mote.  If  the  robot  is  close  to  the  center  of  this  region,  the  estimated  position 
(the  mean  of  the  [3  distribution)  will  have  low  error;  if  the  robot,  however,  is  closer  to  the  edge  of 
this  region,  the  estimated  pose  will  have  higher  error. 

To  further  observe  this  phenomena,  the  average  estimation  error  versus  time  is  displayed 
in  Figure  6.3.  Figure  6.3a,  shows  xy’s  estimation  error  sharply  falling  off  just  before  yx’s  does. 
This  is  caused  by  xy’s  path  encountering  a  sensing  region  before  yx’s  does.  However,  Figure  6.3a 
shows  that  when  yx’s  estimation  error  falls  off  a  little  later  (due  to  it’s  encounter  with  a  detection 
region),  it  drops  to  a  lower  level.  This  is  caused  by  yx  leading  the  robot  through  the  middle  of  the 
sensing  region  and  xy  leading  the  robot  through  the  edge  of  a  sensing  region.  Hence,  the  mean  of 
the  posterior  distribution  f3  better  approximates  the  actual  robot  pose  for  yx  than  xy  after  the  initial 
detections.  However,  even  with  yx  achieving  a  better  posterior  pose  estimation  by  t  —  2  seconds,  and 
receiving  more  detections  overall,  it  still  under  performs  the  xy  path  scheme.  This  is  due  to  xy  more 
consistently  reducing  the  entropy  by  encountering  high  information  regions  (or  detection  region 


mean  estimation  error  (cells)  mean  estimation  error  (cells)  mean  estimation  error  (cells) 
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(c)  all  path  schemes 

Figure  6.3:  Mean  estimation  error  versus  time  during  the  initial  localization  simulations.  Captions 

denote  the  path  schemes  displayed. 
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edges).  To  observe  this  effect,  the  mean  estimation  entropy  versus  time  is  shown  in  Figure  6.4. 
Notice,  xy,  as  expected,  has  lower  entropy  more  often  than  yx.  Furthermore,  the  moment  at  which 
yx  has  lower  entropy  is  during  the  time  yx  lies  in  the  middle  of  a  sensing  region  and  receives  several 
detections,  amounting  to  a  temporary  win  for  yx  at  the  cost  of  network  bandwidth. 

Now,  we  turn  to  comparing  more  precisely  the  differences  in  the  2  intelligent  path  schemes: 
node  centers  and  manual.  Turning  back  to  Table  6.3,  notice  that  the  path  explicitly  using  the  infor¬ 
mation  map  (manual  path)  has  achieved  the  best  estimation  performance.  In  particular,  the  manual 
scheme  has  23.16%  less  mean  estimation  error  and  48.78%  less  mean  entropy  than  the  respective 
next  best  schemes.  Additionally,  it  is  encouraging  that  the  manual  path  only  requires  a  modest 
number  of  detections.  The  next  best  performing  scheme,  node  centers,  requires  210.00%  more  de¬ 
tections  than  the  manual  scheme.  The  success  of  the  manual  path  scheme  is  reiterated  by  Figure  6.3 
where  except  for  a  brief  moment,  the  manual  path  always  maintains  the  lowest  estimation  error. 
Finally,  observe  that,  as  expected,  the  manual  path  scheme  maintains  lower  estimation  entropy  at  all 
times  than  any  other  scheme  as  shown  in  Figure  6.4. 

These  simulations  provide  insight  into  many  of  the  mechanisms  at  play  for  good  localiza¬ 
tion.  ffowever,  since  the  system  dynamics  have  no  noise,  the  results  are  focused  on  how  efficiently 
the  system  can  reduce  the  large  staring  prior  (a  101  cell  by  101  cell  square)  to  an  accurate  position 
estimate.  This  convergence  information  is  summarized  in  Table  6.4.  Notice  that  only  the  manual 
scheme  converges  to  within  a  1  cell  accuracy  of  the  actual  position.  Furthermore,  the  only  other 
path  scheme  to  converge  to  within  2  cells  takes  129.74%  longer  than  the  manual  path  scheme. 
Overall,  use  of  the  information  map  allows  the  robot  to  achieve  the  lowest  mean  estimation  error 
(by  23.16%),  the  lowest  mean  entropy  (by  48.78%),  and  the  best  convergence  times  (56.47%  faster 
to  converge  to  2  cell  accuracy).  Additionally,  the  manual  path  requires  only  a  modest  number  of 
detections  per  100  meter  (about  the  same  as  the  3  simple  planners  required),  providing  low  band¬ 
width  utilization,  ffence,  we  observe  that  good  localization  is  not  strictly  how  many  detections  are 
received.  In  fact,  by  navigating  on  the  edge  of  detections,  the  robot  maintains  low  estimation  error 
and  reduces  the  number  of  detections.  The  information  map  does  not  provide  the  whole  story  either. 
The  previous  map  was  computed  with  a  prior  radius  rp  =  4  cells  implying  that  it  is  designed  for  use 
with  a  system  whose  estimation  error  lies  in  this  range.  We  will  show  that  information  maps  can  be 
useful  for  maintaining  low  error,  but  may  not  reduce  it  as  quickly  as  other  tools.  Presently,  informa¬ 
tion  maps  have  proved  themselves  as  a  useful  tool  for  sensor  network  navigation.  Encouraged  by 
this,  the  next  section  extends  the  system  models  to  be  more  realistic. 


mean  entropy  mean  entropy  mean  entropy 
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Figure  6.4:  Mean  estimation  entropy  versus  time  during  the  initial  localization  simulations. 
Captions  denote  the  path  schemes  displayed. 
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Path  Type 

Mean  Time  to  Converge  (s) 

5.66m  (8  cells) 

2.83m  (4  cells) 

1.41m  (2  cells) 

0.71m  (1  cell) 

xy 

2.33 

2.45 

— 

— 

yx 

1.46 

4.17 

— 

— 

straight 

— 

— 

— 

— 

node  centers 

1.33 

2.59 

4.48 

— 

manual 

1.36 

1.43 

1.95 

1.98 

Table  6.4:  Path  estimation  convergence  times  during  the  initial  localization  simulations. 


6.3  Relaxing  for  Continuous  Dynamics  and  Sensors 

Now  that  the  use  of  information  maps  has  been  justified,  this  section  investigates  how  to 
relax  the  system  models  to  more  closely  resemble  those  of  realistic  sensor  network  systems  such  as 
PEG.  In  this  section,  the  discrete  grid-based  robot  dynamics  from  the  previous  section  are  replaced 
with  a  continuous  space  model.  Furthermore,  the  sensor  model  is  relaxed  from  having  a  square 
detection  grid  to  having  a  continuous  circular  detection  area.  Finally,  to  be  more  consistent  with 
real-world  dynamics,  a  robot  error  model  is  introduced  and  the  prior  is  derived  from  a  normal 
distribution  rather  than  from  the  uniform  distribution  used  previously. 


6.3.1  Robot  Dynamics 

First,  the  robot  dynamics  are  relaxed  to  model  those  of  a  simple  holonomic  robot  equipped 
with  a  compass.  The  robot  is  assumed  to  be  able  to  quickly  set  its  wheel  speed  and  propel  itself  in 
any  direction.  Given  these  constraints,  the  robot’s  state  can  neglect  accelerations  and  orientation. 
More  specifically,  the  robot’s  position  in  the  field  is  represented  with  the  continuous-valued  pose 
4  e  £  c  R2  evolving  according  to  the  continuous,  normal  noise  dynamics  given  by 


4+1  -  h  +  uk  +  dk  (6.7) 

where  the  control  tq  e  U  =  [-U,  U\2  c  R2  and  the  disturbance  values  dj: ,  dj  ~  MX),  c r2d)  with  a 
noise  variance  o~d  that  is  a  function  of  the  control  Uk .  In  particular-,  for  each  component  of  the  control 
u f ,  we  let 

O" d,0  >  tT dX), mi n  |  (6-8) 

where  oy/  0,  (rdX),min  >  0.  This  enforces  that,  above  a  given  control  threshold,  the  variance  varies 
linearly  with  the  magnitude  of  the  control. 


c rd  -  max 
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Before  Markov  localization  can  be  applied  to  the  new  dynamics,  they  must  be  converted  to 
a  transition  distribution  on  the  discrete  pose  space  This  requires  that  the  probability  p{lk+\\h,  uk) 
can  be  computed  for  each  possible  discrete  transition  (Zjt+i,  4)  £  £?  and  a  given  control  uk.  Since 
the  distribution  can  be  partitioned  component-wise  as  p(4+il4>  uk)  =  p(£+]V\^  uj.  )p(l2kl  ^ \l2k,  u2).  the 
transition  computation  is  derived  for  only  one  dimension,  m  e  {1,2}.  Furthermore,  the  new  (single¬ 
dimensional)  transition  relation  is  computed  in  2  steps:  first,  the  starting  pose  /“  is  assumed  to  lie 
within  a  grid  \b™,  b™\  c  and  the  ending  pose  l’k+]  is  fixed  at  a  point  z  £  -£m;  then,  the  ending 
pose  l'k+ !  is  allowed  to  vary  across  a  grid  cell  [a'",  a'"]  c  Xm.  In  this  derivation,  when  a  pose  is  only 
known  to  be  within  a  grid  cell,  the  actual  pose  is  assumed  to  be  uniformly  distributed  across  that 
grid  cell. 

Beginning  the  derivation,  we  assume  that  I™  =  z,  IT  =  | b™ ,  b™  ] ,  and  uT  is  given.  Then, 
the  transition  distribution  is  computed  as 

/7m  _  i  mi  _  rrjn  rjn- 1 

P(!k+1  ~  Z'k  “  — 


where  Af(y;  0,  cr”)  is  the  normal  probability  distribution  function  over  y  with  mean  0  and  variance 
c r2d .  Next,  the  ending  pose  is  allowed  to  vary  across  a  cell:  letting  l'k+]  =  [a™, a™],  equation  (6.16) 


,  m  .  ini  |  ifri\ 

p{lk  +  Uk  +  dk  =  Z\lk ) 


MC  +  d?  =  z  -  u™ |ff) 


£ 


p(lk  =  w|/”!)p«  =z-u™-  w)dw 

p(J?  =  w)p(d™  =  Z  -  «*  -  vv)dvv 
rb"' 

p(dk  -  z  -  uk  -  w)dw 

Uk  °\ 

Pidk  =  y)dy 


bm 

u2 

- 

Jb» 

1 

r 

bm 

u2 

- 

K  Jz- 

1 

r 

bm 

u2 

- 

b?  Jz- 

1 

“k  2 


■Z-u"k'-b"’ 


2  (b™  -  b™) 


u’"-b™ 


erf 


AA  (y;  0,  cTd)dy 
z  -  u'l1  -  b'{ 


V2, 


(T d 


-  erf 


z  -  u'l 


V2, 


CTd 


(6.9) 

(6.10) 

(6.11) 

(6.12) 

(6.13) 

(6.14) 

(6.15) 

(6.16) 
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can  be  integrated  [39]  to  compute  p(/™+|  |/“,  u'jj)  as 


(6.17) 


=  2 oq  -  6»»)  ^(ai) '  ^(a2)  “  ^(a3)  +  ^(a4)] 

=  XT'  [£(®l)  -  ^(«2)  -  ^(«3)  +  ^(«4)] 


(6.18) 


(6.19) 


where  y  =  V2crrf,  £  is  given  by 


(6.20) 


and  the  a  functions  are  given  by 


(6.21) 


(6.22) 


(6.23) 


(6.24) 


This  provides  an  analytical  method  of  computing  the  transition  probabilities.  However, 
there  are  2  difficulties  associated  with  using  this  distribution  for  Markov  localization.  First,  since 
this  computation  is  derived  from  a  model  with  normally  distributed  noise,  the  transition  distribution 
smears  the  density  across  the  entire  field.  It  attributes  a  non- zero  probability  to  even  the  most  remote 
locations.  This  unnecessarily  slows  down  the  simulations.  To  avoid  this,  the  noise  is  truncated  in 
our  model  to  a  finite  range.  A  truncated  noise  model  is  additionally  justified  because  typically  a 
robot  does  not  experience  large  movement  errors  unless  it  encounters  an  obstacle  or  it  transported 
to  a  different  location,  neither  of  which  is  part  of  our  model.  Additionally,  for  such  an  environment, 
a  more  applicable  localization  system  such  as  multiple  hypothesis  tracking  [48,  5]  could  be  used. 
The  second  difficultly  of  using  this  transition  distribution  is  that  even  with  the  error  limited  to  a 
finite  range  this  computation  can  be  slow  in  practice.  This  issue  is  addressed  by  reformatting  the 
computation  of  the  proprioceptive  update  in  the  Markov  localization  algorithm  for  this  model. 

Henceforth,  the  noise  dlk,  dj_  is  restricted  to  the  region  [-D,  D\,  and  the  normal  distribution 
is  replaced  with  the  truncated  normal  distribution: 


0 


if  d"1  t  [~D,D] 


N(dm\  0,  cr2)  =  < 


exp(--^A-)  otherwise 


(6.25) 


,crrfV2ir 
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Algorithm  4  Continuous  Dynamics  Proprioceptive  Update 
1:  given  fjn(ln),  and  un 
2:  for  all  m  e  {1,2}  do 
3:  compute 

4-  compute  r 

s-  let  vm  =  IS'"  S'"  +  1  Sm  I 

lcl/t  l0 min’  °min  +  1’  •  •  •  >  °max> 

6:  for  all  5m  £  xm  do 

7:  compute  a  -  (S'"  +  1)A  -  u'" 

8:  compute  T™  =  ^  [£(«)  -  2£(a  -  A)  +  £(ar  -  2A)] 

9:  for  all  d1  e^d2  ex2  do 

10:  compute  0’n  (/„):  shift  /?„(/„)  by  (d1,  d2)  adding  zeros  where  necessary 

11:  compute  Pn(.ln+1 )  =  IVeU^er1  ^2^ ’^  (^«) 


where 


1 


a  = -  (6.26) 

erf  (Ply) 

Then,  to  compute  the  proprioceptive  update,  lines  2-6  of  Algorithm  3  are  removed,  and  an  updated 
computation  tailored  for  the  new  model  is  inserted  between  lines  1  and  2.  The  new  computa¬ 
tion,  listed  as  Algorithm  4,  exploits  2  properties  of  the  dynamics.  First,  the  transition  probability 
p(l-\rk,  u™)  can  be  reduced  to  a  function  of  only  the  distance  S'"  between  the  starting  and  ending 
grid  cell,  rather  than  a  function  of  the  actual  grid  cells.  Second,  since  the  control  is  known  and 
the  noise  is  bounded,  the  range  x'n  of  possible  distances  S’"  with  non- zero  transition  probability 
is  also  known  and  bounded.  More  specifically,  for  any  2  (single -dimensional)  grid  cells  separated 
by  distance  S’",  the  a  functions  in  equations  (6.21)  through  (6.24)  only  depend  on  S’".  Hence,  the 
£  function  in  equation  (6.20)  and  the  transition  distribution  in  equation  (6.19)  only  depend  on  S’", 
allowing  the  proprioceptive  update  to  be  computed  as  a  shift  and  a  scaling:  the  prior  distribution  is 
shifted  by  S'"  and  scaled  by  the  transition  probability  for  S'".  Referring  to  Algorithm  4,  the  com¬ 
putation  proceeds  by  computing  the  scaling  factor  T'”„  for  a  shift  of  S'"  in  the  m,h  dimension  (lines 
2-8).  For  each  dimension  m  e  {1,2},  lines  3  through  5  compute  the  distances^'"  that  the  robot  could 
move  with  potentially  non-zero  transition  probability.  Then,  for  each  possible  distance  S'"  e  /ym, 
lines  7  and  8  compute  the  scaling  factor  given  by  equation  (6.19).  Next,  lines  9  and  10  compute,  for 


each  2  dimensional  shift  (d1,  S2),  a  shifted  prior  distribution  ffn  (/„).  This  distribution  is  computed 


•S',62 
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as 


[3n(ln  -  (5\82))  for  (ln-(6\62))e£ 


(6.27) 


0 


otherwise 


Finally,  line  1 1  computes  the  proprioceptive  update  as  a  superimposition  of  shifted  and  scaled  priors. 

This  extra  effort  is  not  in  vain;  the  time  required  by  localization  has  been  reduced  signif¬ 
icantly.  Assuming  that  nx  =  max  {[y1 1,  \x2\],  then  an  upper  bound  on  the  time  complexity  of  this 
algorithm  is  found  to  be 


2[ci  +  C2+  HX(C3  +  c4)]  +  nxc5  +  (nx  -  l)2c6 


(6.28) 


(6.29) 


(6.30) 


Flence,  the  time  required  for  the  proprioceptive  update  is  reduced  from  O(nf)  required  by  the  Markov 
localization  algorithm.  Algorithm  3,  to  (9(/?“).  Furthermore,  assuming  that  £,  is  square,  then  nx  < 
y/nj.  Typically,  since  only  a  small  region  of  the  grid  cells  are  reachable  for  any  time  step,  nx  is  much 
smaller  than  ^/nj.  Hence,  the  time  complexity  has  been  reduced  from  O(nj)  to  something  less  than 

(9(71/). 

The  last  detail  required  by  the  new  dynamics  model  is  a  method  of  simulating  the  trun¬ 
cated  normal  distribution.  To  do  this,  the  standard  method  of  evaluating  the  inverse  of  the  cumulative 
density  function  with  a  uniformly  distributed  random  variable  is  used.  In  particular,  given  a  random 
variable  v  ~  U[0, 1]  and  the  cumulative  density  function 


•d 


(6.31) 


the  (single  dimensional)  noise  is  computed  as 


d’ii  =  r\v) 


(6.32) 


(6.33) 


where  y  =  \f2(rfj  and  a  =  1/erf  ( D/y ). 


6.3.2  Robot  Prior 


Since  the  error  in  the  robot  model  is  (approximately)  normally  distributed,  the  prior  is 
also  allowed  to  assume  an  (approximate)  normal  distribution.  For  a  normal  prior  centered  about  a 
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robot  at  Zq  e  X  with  covariance 


1 


&2p  0 
0  &2p 


P(l )  =  7 


erf 


'itj  +  A/2  -  Z; 


<5>  V 2 


,  the  prior  on  the  discrete  space  X  can  be  shown  to  be 

(6.34) 

(6.35) 


erf 

flij  +  A/2-QI 

-  erf 

pi.-A/2-ZjV 

.  o-pV2  y 

,  &p  V2  J_ 

T2\ 

0 


-  erf 


sfj  -  A/2  - 


d-p  V2 


where  gjj  e  X  is  the  center  of  cell  Z.  To  extend  this  to  a  truncated  normal,  p(l)  is  set  to  0  outside  the 
[-P,  P]2  square  centered  at  Zq  and  re-normalized. 


6.3.3  Sensor  Model 


The  binary  sensor  model  is  relaxed  by  allowing  it  to  operate  on  the  continuous-valued 
robot  pose  space  X,  and  the  detection  area  is  allowed  to  become  circular.  Then,  for  each  sampling 
period,  the  probability  of  detecting  a  robot  at  Z  e  X  for  the  kth  sensor  located  at  ap  e  X  is  given  by 
the  continuous,  circular,  binary  sensor  model'. 


p(y  =  ©10  = 


Ps 


0 


if  \\l-ak\\2<rs 
otherwise 


(6.36) 


where  rs  >  0  is  the  detection  radius.  Again,  for  Markov  localization,  this  distribution  must  be 
tessellated  to  find  p(y\l).  Similar  work  [66,  36]  has  been  done  on  discretizing  the  sensor  model 
for  ultrasonic  range  sensors.  Assuming  that  the  robot  pose  is  independent  of  the  sensing  model, 
p(J,y)  =  piy\l)p{J),  the  distribution  can  be  reduced: 


p(y\i  =  (U  f>) 


P(y\l  e  Qi.j ) 

JkgiJp(J)p(y\i) 


(6.37) 

(6.38) 


Since  the  exact  robot  pose  is  unknown  a  priori,  it  is  assumed  that  p{l)  ~  X/(X),  further  reducing  the 
distribution: 


p(y\i  =  (U  ;» 


1 

A2 


(6.39) 

(6.40) 


(6.41) 
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Figure  6.5:  Approximating  the  area  1R  used  by  the  continuous,  circular,  binary  sensor  model. 


where  A  is  the  grid  side  length  and  1R  =  Area n  {/  :  d(J,a)  <  rv(). 

The  area  1R  can  be  estimated  with  a  few  simple  approximations.  To  begin  with,  the  grid 
cell  Qij  is  by  circumscribed  by  a  circle  of  radius  e  =  -^=  as  shown  in  Figure  6.5.  We  let  gij  e  11 
be  the  center  of  the  grid  cell  Qij,  and  djjjc  be  the  distance  from  the  center  to  the  kth  mote,  that  is, 
di,j,k  =  || gij  -  ak\\.  Then,  we  delineate  the  area  computation  for  3  ranges  of  dij %k  values: 

•  dij,*  e  |0,  rs  -  e\:  Jl  =  A2 

.  duk  e  (rs  -  e,  rs  +  e]:&  =  A2  ((r-+^) 

•  dij#  e  (rs  +  e,  oo):  UK  =  0 


The  first  and  third  terms  are  obvious:  if  the  grid  cell  is  entirely  enclosed  or  entirely  outside  the 
sensing  area,  then  the  overlapping  area  1R  is  A2  or  0,  respectively.  When  Qt  j  is  partially  covered  by 
the  sensing  region,  as  a  easily  computable  approximation,  we  let  1R  vary  linearly  with  dLj^.  Now, 
using  this  in  equation  (6.41),  the  approximate  discretized  sensor  model  for  Markov  localization  is 
found  to  be 

Ps  if  0  <  ||a*  -  g||  <  rs  -  e 


p{y  =  ©|/)  w  - 


p.I(r+£-||qt-g||) 

2e 


if 


rs  -  €  <  || ak  -  g||  <  rs  +  e 


(6.42) 


0  if  rs  +  e<  \\ak-g\\ 

where  e  =  A/  V2. 

We  have  now  developed  more  realistic  models  for  the  robot,  the  sensor  network,  and  the 
robot  prior.  These  models  have  been  allowed  to  operate  in  the  continuous  space  £  and  have  been 
discretized  for  use  with  Markov  localization.  In  the  next  section,  the  new  models  are  applied  to 
several  simulations. 
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Field 

size 

100m  x  100m 

grid 

200  x  200 

sensor  distribution 

uniform 

Ns 

30 

Sensor 

model 

continuous,  circular,  binary 

rs 

5  m 

Ps 

0.90 

Estimated  Prior 

prior  model 

discretized,  truncated  normal 

&p 

0.75  m 

P 

3 &p  =  2.25  m 

Estimator 

prior  model 

discrete,  box 

rP 

50  cells 

Dynamics 

model 

continuous,  normal  noise 

U 

5  m 

o~d,  0 

0.1  m 

C  d,0,min 

0.01  m 

D 

3 (Td  =  0.3  m 

Pose 

k) 

5  10 

T 

Igoal 

70  90 

~l 

Simulation 

trials 

100 

path  type 

xy,  yx,  straight,  node  centers,  manual,  hybrid 

Table  6.5:  Systems  model  and  parameters  used  during  the  localization  simulation  with  improved 
models. 

6.4  Continuous  Simulations 

Using  the  new  sensor,  robot  dynamics,  and  prior  models  from  the  previous  section,  this 
section  runs  several  localization  simulations.  First,  the  simulation  setup  is  described.  Then,  the 
results  are  presented  and  discussed. 

The  simulation  setup  is  summarized  in  Table  6.5.  The  simulations  use  the  same  100m  by 
100m  field  with  30  sensors  uniformly  deployed  as  the  previous  section.  The  robot’s  starting  and 
ending  points  remain  the  same.  The  rest  of  the  models,  however,  have  changed.  For  the  sensor 
nodes,  the  continuous,  circular  binary  sensor  model  with  detection  radius  rs  =  5  meters  and  detec¬ 
tion  probability  ps  =  0.90  is  used.  The  estimated  prior,  used  for  calculating  the  information  map,  is 
a  truncated  normal  with  standard  deviation  frp  =  0.75  m.  The  distribution  is  truncated  at  the  3&p 
radius  which  amounts  to  a  9  cell  by  9  cell  support  square  (similar  to  previous  simulations).  The 
estimator  is  initialized  with  the  same  50  cell  radius  discrete,  box  prior.  The  new  continuous,  normal 
noise  dynamics  model  is  used  with  control  bound  U  =  5  meters  and  noise  variance  cr2  —  0.01  me- 
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100 


Figure  6.6:  Information  map  using  the  improved  models.  White  dots  outlined  in  black  represent 
motes,  and  dotted  circles  represent  their  detection  region.  Background  color  of  the  plot  represents 
the  amount  of  information. 

ters.  The  noise  remains  low  for  this  simulation  so  the  effect  the  new  models  have  on  the  simulation 
can  be  observed  independently.  Later,  a  closed-loop  control  will  be  added,  and  the  noise  variance 
will  be  increased.  The  same  path  schemes,  save  one,  are  used;  the  paths,  however,  they  generate  are 
different.  One  path  scheme,  hybrid,  is  added  to  the  line  up;  it  is  a  combination  of  2  schemes:  node 
centers  and  manual.  The  hybrid  path  initially  follows  the  node  centers  path,  but  once  the  estimation 
entropy  has  decreased  below  a  given  threshold,  it  jumps  over  to  the  manual  path  and  remains  on  this 
until  the  destination  is  achieved.  This  path  scheme  is  motivated  by  the  fact  that,  as  noted  previously, 
the  information  map  is  tuned  to  a  estimation  prior  with  supporting  radius  of  4  cells,  and  that  driving 
through  the  node  centers  quickly  reduces  large  initial  priors.  This  path  is  assumed  to  be  provided 
for  now,  but,  later  in  this  section,  the  determination  of  this  path  is  explored. 

As  before,  the  information  map  is  computed  prior  to  simulation.  The  information  map 
is  shown  in  Figure  6.6.  As  expected,  this  information  map  has  circular  high  information  regions 
as  opposed  to  the  previous  simulation  with  square  regions.  Additionally,  the  edges  of  the  high 
information  regions  tend  to  roll  off  smoothly  for  this  new  map.  This  is  due  to  the  information 
map  computation  using  a  normal  prior  with  circular  support  instead  of  a  uniform  prior  with  square 
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Path  Type 

Estimation  Error  (m) 

Estimation  Entropy 

Min 

Mean 

Max 

Min 

Mean 

Max 

xy 

2.44 

2.91 

8.22 

4.42 

4.54 

6.05 

yx 

5.08 

5.20 

5.32 

6.29 

6.30 

6.51 

straight 

3.54 

3.99 

7.29 

5.24 

5.34 

5.92 

node  centers 

1.93 

2.15 

3.99 

3.65 

3.76 

4.55 

manual 

2.16 

2.32 

2.86 

3.31 

3.38 

3.75 

hybrid 

1.52 

1.69 

2.66 

3.10 

3.21 

3.55 

Table  6.6:  Estimation  error  and  entropy  for  each  path  scheme  across  all  trials  during  the  localization 
simulations  with  improved  models. 


support.  This  property  also  provides  the  sense  that  the  high  information  regions  have  smaller  width 
than  before,  even  though  the  priors  used  similar  diameter  values. 

Once  the  information  map  is  computed,  the  localization  simulation  commences.  For  each 
path  scheme,  100  trials  are  ran,  and  various  statistics  are  recorded  .  Each  path  scheme,  typical 
estimated  path,  and  information  map  is  shown  in  Figure  6.7.  The  3  basic  paths  schemes  (xy,  yx, 
and  straight)  use  the  same  paths  as  before  (Section  6.2),  but  the  estimation  algorithm  clearly  has 
more  difficulty  in  converging  to  the  true  path.  In  particular,  the  typical  estimated  paths  do  not 
converge  to  and  track  the  actual  path  like  before.  The  3  intelligent  schemes  typically  do  much  better 
at  converging  to  the  actual  path.  Notice,  unlike  before,  the  node  centers  and  manual  path  route 
through  different  regions  of  the  field.  In  particular,  the  node  centers  path  stays  to  the  left  of  the 
field  and  goes  through  one  region  with  particularly  high  information.  The  manual  path,  on  the  other 
hand,  goes  through  this  region,  then  heads  to  the  right  and  goes  through  another  high  information 
region.  Finally,  notice  that  the  hybrid  path  is  a  combination  of  the  previous  2  routes:  initially,  it 
follows  the  node  centers  route;  then,  it  follows  the  manual  route. 

A  summary  of  the  estimation  statistics,  shown  in  Table  6.6,  looks  similar  to  the  previous 
set  of  simulations.  The  intelligent  path  planners  have  proven  themselves  capable  of  tackling  the  new 
system  models  equally  well.  The  amount  the  mean  estimation  error  changes  from  the  basic  schemes 
to  the  intelligent  schemes,  however,  is  not  as  drastic  as  before.  In  general,  all  the  path  schemes  have 
a  more  difficult  time  reducing  the  localization  error.  Additionally,  the  manual  path  has  slightly 
higher  mean  estimation  error  than  the  node  centers  path,  but  lower  estimation  entropy.  This  occurs 
due  to  the  phenomenon  mentioned  in  Section  6.2:  the  information  map  is  tuned  for  a  small  prior,  the 
system  starts  with  a  large  prior,  and  the  timing  and  location  of  initial  detections  dramatically  effects 
the  initial  convergence  of  the  estimation  error.  These  properties  are  the  inspiration  for  the  hybrid 
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(a)  xy 


(b)  yx 
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(d)  node  centers 


50  100 

(c)  straight 
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50 


50 
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(e)  manual 


(f)  hybrid 


Figure  6.7:  Basic  simulation  overview  plots  from  the  localization  simulations  with  improved  mod¬ 
els.  Depicted  is  the  robot  starting  (white  circle)  and  ending  (white  square)  positions,  the  robot  path 
(solid  black  line),  and  the  typical  estimated  robot  path  (solid  light  green  line)  superimposed  on  the 
information  map.  Each  figure  is  captioned  with  the  path  scheme  used. 
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path:  use  the  node  centers  path  to  quickly  reduce  the  initial  estimation  error,  then  use  the  manual 
path  to  maintain  a  reduced  estimation  error.  Although,  we  have  yet  to  discuss  the  development  of 
the  hybrid  path,  we  see  from  Table  6.6  that  the  hybrid  path  achieves  21.40%  less  estimation  error 
and  5.03%  less  entropy  than  the  next  best  paths,  respectively.  This  supports  the  notion  that  node 
centers  is  good  at  initializing  the  estimator  and  reducing  the  error  to  the  size  of  a  detection  region; 
whereas,  the  information  based  path  is  good  at  further  reducing  and  maintaining  low  error. 

A  closer  look  at  the  estimation  error  traces,  shown  in  Figure  6.8,  shows  more  precisely 
how  the  estimation  error  decreases  over  time.  Notice  that  the  manual  path  lags  behind  the  node 
centers  and  hybrid  path  schemes  in  reducing  the  estimation  error.  The  initial  sharp  drop-off  on  all 
the  schemes  is  due  to  when  they  (on  average)  receive  their  first  detection.  Since  the  manual  path 
tends  to  skirt  the  detection  regions,  it  takes  longer  to  receive  its  first  detections.  The  node  centers 
and  hybrid  scheme  are  as  quick  as  any  scheme  at  receiving  their  first  detection  by  driving  the  robot 
right  to  a  sensor.  Once  the  manual  path,  however,  reduces  the  estimation  error  to  levels  similar 
to  those  of  the  other  2  intelligent  schemes,  it  performs  equally  well  at  keeping  the  error  low.  In  a 
later  section,  more  noisy  simulations  will  demonstrate  that  the  manual  path  actually  does  better  at 
maintaining  low  error  than  the  other  schemes. 

A  look  at  the  estimation  entropy  traces,  shown  in  Figure  6.9,  reveals  further  distinctions 
amongst  the  path  schemes.  While  neither  node  centers  nor  manual  stand  out  as  being  the  quickest 
to  reduce  entropy,  it  is  clear  that  the  manual  path  converges  to  a  lower  entropy  value.  After  t=1.2 
seconds,  the  average  entropy  of  the  manual  path  is  about  18.5%  less  than  that  of  the  node  centers, 
providing  evidence  that  the  manual  path  tends  to  produce  a  more  confident  estimate.  Again,  here 
we  see  the  hybrid  path  provides  the  best  of  both  worlds.  Not  only  does  it  out  perform  node  cen¬ 
ter’s  estimation  error,  but  it  keeps  pace  with  and  outperforms  all  the  schemes  in  turns  of  entropy 
reduction. 

Observing  the  entropy  traces,  we  can  more  adequately  address  how  the  hybrid  path  is 
chosen.  A  entropy  threshold  value  is  chosen  for  switching  between  the  node  centers  and  manual 
paths.  Since,  the  manual  path  exploits  the  information  map,  and  the  information  map  is  tuned  to  a 
system  with  a  truncated  normal  proprioceptive  prior  with  cr2,  =  2.25  cells,  the  threshold  should  be 
set  to  switch  to  the  manual  path  when  the  actual  prior  is  similar  to  this  distribution.  The  entropy 
of  this  distribution  is  4.36,  hence,  the  hybrid  path  switches  from  node  centers  once  it  (on  average) 
achieves  an  entropy  value  of  this  or  lower.  The  node  centers  path  achieves  this  by  the  3rd  way  point 
(it  achieves  entropy  of  3.15).  So  the  hybrid  path  follows  the  first  3  way  points  of  node  centers, 
then  switches  to  the  manual  path.  In  practice,  this  switching  mechanism  does  not  have  access  to 


mean  estimation  error  (cells)  mean  estimation  error  (cells)  mean  estimation  error  (cells) 
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Figure  6.8:  Mean  estimation  error  versus  time  during  the  localization  simulations  with  improved 
models.  Captions  denote  the  path  scheme  displayed. 
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time  (s) 


(a)  xy  (solid  black),  yx  (dashed  green),  straight  (dotted  blue) 
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0  0.5  1  1.5  2  2.5  3  3.5 

time  (s) 


(b)  node  centers  (solid  black),  manual  (dashed  green),  hybrid  (dotted  blue) 
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(c)  all  path  schemes 

Figure  6.9:  Mean  estimation  entropy  versus  time  during  the  localization  simulations  with  improved 
models.  Captions  denote  the  path  scheme  displayed. 
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Path  Type 

Mean  Time  to  Converge  (s) 

5.66m  (8  cells) 

2.83m  (4  cells) 

1.41m  (2  cells) 

0.71m  (1  cell) 

xy 

0.56 

0.69 

— 

— 

yx 

1.40 

1.61 

— 

— 

straight 

0.43 

— 

— 

— 

node  centers 

0.32 

0.44 

0.87 

— 

manual 

0.41 

0.83 

1.11 

— 

hybrid 

0.33 

0.49 

0.79 

— 

Table  6.7:  Estimation  convergence  times  during  the  localization  simulations  with  improved  models. 


Path  Type 

Length  (m) 

tp(s) 

nd 

nd/ 1 00 m 

xy 

145.00 

2.80 

7.97 

5.50 

yx 

145.00 

2.80 

1.85 

1.28 

straight 

103.08 

1.60 

5.34 

5.18 

node  centers 

151.03 

2.60 

19.16 

12.69 

manual 

216.58 

3.30 

24.36 

11.25 

hybrid 

219.97 

3.40 

20.49 

9.31 

Table  6.8:  Path  information  for  each  path  scheme  during  the  localization  simulations  with  improved 
models. 


several  prior  runs  of  the  node  centers  scheme,  and,  hence,  the  switching  would  be  done  based  on 
the  run-time  value  of  entropy  (with  some  hysteresis). 

For  completeness.  Table  6.7  shows  the  estimation  error  convergence  times  for  the  path 
schemes.  This  table  reiterates  that  the  basic  path  schemes  provide  poor  and,  on  average,  unreliable 
convergence.  The  intelligent  schemes  are  all  capable  of  converging  to  within  2  grid  cells  accuracy, 
with  the  hybrid  path  providing  the  quickest  convergence  time  by  9.20%  over  the  next  fastest  scheme. 
Additionally,  notice  that  the  hybrid  scheme  sacrifices  a  minuscule  amount  of  convergence  time 
initially  to  gain  more  later. 

The  final  statistics  collected  during  the  simulations,  related  to  path  information  and  de¬ 
tections,  is  compiled  in  Table  6.8.  This  table  provides  a  comparison  of  the  paths  based  on  length, 
travel  time,  and  number  of  detections.  The  manual  path  lives  up  to  expectations  by  requiring  fewer 
detections  per  100  meters  than  the  node  centers  path;  this  is  achieved  by  riding  the  edges  of  detec¬ 
tion.  Even  though  the  hybrid  path  is  the  longest  path,  it  only  requires  a  modest  9.31  detections  for 
every  100m  of  travel.  Apparently,  the  hybrid  path  has  provided  the  best  estimation  performance  at 
the  lowest  bandwidth  utilization. 
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The  goal  of  this  work  has  been  to  learn  how  to  adapt  the  well-performing  manual  scheme 
for  simple  systems  (as  shown  in  Section  6.2)  to  a  scheme  for  real  systems.  In  particular,  care  must  be 
taken  to  use  the  information  maps  at  the  correct  time.  We  observed  that  the  node  centers  path  works 
well  for  quick  initialization  of  a  system  and  the  information  based  path  worked  well  for  the  fine- 
tuning  and  continual  maintenance  of  the  estimation  system.  This  leads  us  to  conjecture  that  a  more 
complete  information  map  path  planning  system  may  make  use  of  several  information  maps,  each 
one  tuned  to  a  different  prior  entropy  value.  Such  a  system  could  switch  amongst  the  information 
maps  as  the  online  entropy  values  change.  This  behavior  was  approximated  by  the  hybrid  scheme 
and  was  shown  to  perform  well. 

To  more  adequately  explore  the  performance  of  information  based  systems,  we  need  to 
develop  and  test  a  closed-loop  system  with  more  significant  noise  levels.  Creation  of  a  closed-loop 
system  requires  the  development  of  automatic  path  planners  that  can  rival  the  manually  chosen 
paths.  Looking  forward  enlightens  our  previous  choices.  The  reason  we  have  been  relying  on 
information  maps  with  relatively  small  priors  is  because  once  a  closed-loop  system  is  developed, 
the  entropy  will,  on  average,  be  low  and  will  stay  low  due  to  the  closed-loop  information  map  based 
path  planner.  These  notions  are  explored  in  the  next  chapter. 
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Chapter  7 

Information  Maps  and  Path  Planning 


Previous  sections  have  investigated  the  computation  of  information  maps  and  their  utility 
in  choosing  useful  paths  for  navigation.  This  section  builds  on  this  work  and  develops  several 
techniques  for  automatic  path  planning  using  node  information  and  information  maps.  First,  the 
semantics  of  entropy  and  information  and  how  they  can  be  interpreted  as  a  cost  is  discussed.  Then, 
using  the  cost  functions,  automatic  techniques  for  determining  paths  are  discussed  along  with  their 
respective  time  complexity.  Finally,  closed-loop  localization  simulations  are  performed  for  multiple 
way -points  on  multiple  maps,  and  the  results  are  compared.  By  the  close  of  this  section,  a  complete 
and  fully  functional  sensor  network  navigation  system  will  have  been  developed  and  evaluated  on  a 
set  of  realistic  system  models. 

7.1  Interpreting  Entropy 

Entropy  provides  a  measure  of  a  distribution’s  disorder  -  the  larger  the  entropy,  the  more 
scattered  a  distribution.  The  information  value  for  a  particular  cell  provides  us  with  a  measure 
of  how  much  a  prior’s  entropy  is  expected  to  change  after  sensor  readings.  Taken  together,  the 
information  map  provides  a  method  for  estimating  how  much  less  scattered  the  robot’s  estimation 
is  expected  to  be  after  sensor  readings.  However,  the  mapping  from  amount  of  scatter  to  entropy 
change  is  not  clear.  This  section  develops  the  relationship  between  entropy  and  information  and 
provides  an  interpretation  of  information  as  a  measure  of  the  expected  reduction  in  a  distribution’s 
support. 

Consider  the  relationship  between  entropy  and  cells  covered.  This  relationship  depends  on 
the  distribution  used  as  shown  in  Figure  7.1.  Notice,  the  entropy  values  for  the  normal  distribution 
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(a)  rf/2  =  0.5,  (b)  d/2  =1.5,  (c)  d/2  =  2.5,  (d)  d/2  =  4.5,  (e)  d/2  =  8.5, 

■7/  =  0.00  "77  =  2.20  "77  =  3.22  77  =  4.39  77  =  5.67 


(f)3cr  =  0.5,  (g)  3cr  =  1.5,  (h)  3c  =  2.5,  (i)  3c  =  4.5,  (j)  3c  =  8.5, 

77  =  0.34  77  =  1.62  -77  =  2.81  -77  =  4.13  77  =  5.50 

Figure  7.1:  Entropy  values  for  2  example  distributions  with  increasing  sizes  on  a  29x29  grid  with 
the  mean  at  cell  (15, 15).  The  top  plots  are  of  uniform  distributions  over  a  d  by  d  square;  the  bottom 
plots  are  of  a  normal  distribution  with  given  standard  deviation.  All  plots  list  the  entropy  value  77. 


are  smaller  than  those  for  the  uniform  distribution.  This  is  always  the  case:  the  uniform  distribution 
provides  an  upper  bound.  A  uniform  distribution  covering  n  cells,  has  a  probability  density  of  p  =  \ 
for  each  cell.  The  entropy  of  this  distribution  is  77  =  -£„/?  log(p)  =  log(n).  Notice,  the  logarithmic 
progression  of  the  entropy  versus  the  area  from  left  to  right  in  Figure  7.1.  Given  an  entropy  value 
77  derived  from  a  uniform  distribution,  the  number  of  cells  covered  by  the  distribution  is  given  by 
n  -  eH .  Furthermore,  assuming  a  uniform  distribution  with  entropy  Hq  (covering  no  =  eH°  cells)  is 
reduced  to  a  uniform  distribution  with  entropy  Hi  (covering  n  \  =  eH>  cells),  the  decrease  in  support 
is  A n  =  no  -  ni  =  eH°  -  eH] .  For  the  information  map  computation,  where  a  prior  with  constant 
entropy  Ho  is  used,  it  is  convenient  to  rewrite  the  change  in  support  as 

An  =  eH°  -  eH>  =  eH°  -  e //o_/  =  eH°(l  -  e~:)  (7. 1) 

where  we  have  used  that  I  =  Hq  -  H\ .  This  term  is  derived  for  uniform  distributions,  but  can  be 
used  to  aid  us  in  interpreting  information  for  other  distributions  too. 

Given  this  relationship  between  information  and  support,  overall  cost  or  utility  can  be 
interpreted  in  many  ways.  We  present  2  methods.  First,  the  cost  k  could  be  an  affine  transformation 
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of  the  change  in  support,  such  as 


k  -  Co  -  ci(An) 

(7.2) 

1 

1 

CT 

i 

o 

II 

(7.3) 

=  Co  +  C2(e  1  -  1) 

(7.4) 

where  cq,  c\,  c2  >0  and  c o  is  chosen  largest  enough  such  that  k  >  0.  This  representation  assigns  low 

costs  to  large  reductions  in  number  of  cells  involved  in  the  support. 

be  used  to  represent  the  increase  in  support  diameter,  such  as 

Alternatively,  the  cost  k  could 

k  -  cq-  ci(  s/no  -  Vffi) 

(7.5) 

=  Co  -  Cl  VPb(l  -  VP) 

(7.6) 

=  Co  +  C2(VP-  1) 

(7.7) 

where  co,  ci,  c2  >0  and  c o  is  chosen  largest  enough  such  that  k  >  0.  This  representation  assigns  low 
costs  to  large  reductions  in  the  diameter  of  the  support.  Unless  stated  otherwise,  the  cost  k  always 
refers  to  the  former  affine  cost  function.  The  next  section  uses  the  cost  interpretation  to  develop 
path  planners. 

7.2  Automating  Path  Planning 

Up  to  this  point,  a  complete  closed-loop  navigation  system  sans  a  path  planner  has  been 
developed.  This  section  fills  in  the  gap  by  developing  several  path  planners,  some  of  which  have 
already  been  used  in  previous  simulations  without  explanation.  An  overview  of  each  path  planner 
along  with  running  time  analysis  and  example  paths  is  presented. 

The  previously  used  simple  path  planners  (xy,  yx,  and  straight)  and  obvious  and  fixed; 
hence,  the  following  investigation  focuses  on  only  the  intelligent  path  planners.  Three  such  plan¬ 
ners  are  discussed:  information  grid,  node  centers,  and  information  clusters.  The  general  modus 
operandi  of  each  of  these  planners  is  the  same:  encode  the  space  £.  and  any  path  information  into  a 
weighted,  directed  graph;  then,  apply  a  least  cost  optimization  routine  to  generate  a  path  in  The 
difference  in  each  planner  is  the  manner  in  which  the  graph  is  constructed.  For  the  optimization 
routine,  many  tools  exists  [25,  80],  such  as  Dijkstra,  A*,  or  Bellman-Ford.  To  avoid  path  planners 
that  create  loops  or  back-track  significantly,  the  graph  edge  weights  are  restricted  to  positive  values. 
Flence,  generalized  dynamic  programming  routines  like  Bellman-Ford  are  not  required.  Addition- 


94 


Figure  7.2:  Paths  generated  by  the  information  grid  path  planner  {cK  =  1  and  cd  =  10)  for  3  different 
mote  topologies. 

ally,  since  the  primary  focus  is  not  efficiency,  but  path  effectiveness,  heuristic  based  routines  such 
as  A*  are  ruled  out.  For  our  work,  the  Dijkstra  algorithm  is  used  for  finding  the  lowest  cost  path. 

The  Dijkstra  algorithm  requires  a  weighted,  directed  graph  G  with  vertices  V  and  edges  E 
such  that,  for  each  edge  e  =  ( u ,  v)  e  E,  the  cost  of  going  from  u  to  v  is  given  by  w(u,  v )  defined  by  the 
positive  cost  function  w  :  E  — *  [0,  oo).  Given  starting  and  ending  vertices  uq,  Uf  e  V,  the  algorithm 
computes  the  lowest  cost  path  P  e  V*,  where  V*  is  the  set  of  all  finite  sequences  of  vertices.  For 
general  graphs,  the  time  complexity  of  this  algorithm  is  0(rfy  where  nv  =  |V|. 

The  most  straightforward  application  of  the  Dijkstra  algorithm  is  realized  by  the  informa¬ 
tion  grid  path  planner.  This  path  planner  encodes  the  entire  information  map  in  a  graph.  A  similar 
path  planer  [78]  has  been  previously  developed  that  additionally  incorporates  an  obstacle  map.  In 
our  approach,  the  set  of  vertices  for  the  graph  is  defined  to  be  the  set  of  all  grid  cells  {Qij}-  Then, 
each  grid  cell  (vertex)  is  defined  to  have  an  edge  connecting  to  the  immediately  adjacent  grid  cells 
(vertices).  Finally,  the  overall  cost  to  go  from  vertex  u  =  gij  to  vertex  v  =  g^j  is  defined  as  a  linear 
combination  of  the  ending  cell’s  information  cost  k(v )  =  Ay/;./)  and  the  2-norm  distance  between 
the  cells’  centers: 

w(u,  v)  =  ckk(v)  +  cd\\u  -  v\\2  (7.8) 

where  the  mixing  proportions  cK,  cd  are  non-negative.  This  function  assigns  far  away  cells  with  low 
information  a  high  cost.  Figure  7.2  shows  the  path  generated  by  the  information  grid  path  planner 
for  3  different  node  configurations.  For  each  plot,  the  field  spans  100  meters  by  100  meters,  and  the 
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Figure  7.3:  Paths  generated  by  the  node  centers  path  planner  for  3  different  mote  topologies. 


robot  starts  in  the  lower  left  at 


.  These  plots  have  used 


5  10  and  ends  in  the  upper  right  at  70  90 
cK  =  1  and  Cd  =  10.  It  turns  out,  for  more  general  values  of  cK  and  Cd,  that  this  path  planner  tends  to 
choose  fairly  straight  paths  deviating  only  slightly  to  encounter  higher  information  regions.  Later 
simulations  will  demonstrate  that  the  paths  generated  by  this  planner  tend  to  reduce  travel  time  to 
the  destination  by  sacrificing  some  reduction  in  estimation  error.  A  disadvantage  of  this  planner  is 
the  computation  time  required.  Since  it  uses  all  the  grid  cells  as  vertices,  the  number  of  vertices 
is  nv  =  ti/,  inducing  a  O(nj)  running  time.  The  remaining  path  planners  that  are  presented  require 
significantly  less  time. 

The  node  centers  path  planner  embodies  the  idea  that  the  robot  should  drive  through  many 
detection  centers  on  the  way  to  the  destination.  This  planner  uses  the  set  of  mote  locations  and  the 
robot’s  starting  and  ending  positions  as  the  graph’s  vertices  for  the  Dijkstra  algorithm.  It  creates 
edges  amongst  all  mote  locations  and  robot  starting  and  ending  positions,  and  encodes  the  cost 
from  u  to  v  as  the  exponential  of  the  2-norm  distance: 

w{u,  v)  =  e11"-1’1'2  (7.9) 


The  advantage  to  using  an  exponential  in  the  cost  function  is  that  it  encourages  the  planner  to 
take  many  small  leaps  towards  the  destination  (encountering  several  nodes);  whereas,  without  the 
exponential,  the  lowest  cost  path  would  be  the  straight  line  connecting  the  starting  and  destination 
positions.  Three  runs  of  the  node  centers  planner  are  shown  in  Figure  7.3.  The  chosen  paths  tend 
to  find  routes  supported  by  many  close  detection  areas.  The  planner  will  prefer  a  path  along  many 
close  motes  to  a  path  that  take  large  jumps  in  between  groups  of  motes  as  shown  by  the  middle  plot. 


96 


Figure  7.4:  Paths  generated  by  the  information  clusters  path  planner  (nv  =  100  and  dp  =  5)  for  3 
different  mote  topologies. 

Finally,  since  this  planner  only  uses  the  mote  locations  and  2  robot  positions  as  vertices,  the  running 
time  is  O(N^)  where  Ns  is  the  number  of  motes  and  is  typically  much  less  that  «/. 

The  final  path  planner,  information  clusters,  combines  several  of  the  previous  ideas.  It  at¬ 
tempts  to  generate  a  path  that  prefers  many  small  leaps  (like  node  centers)  favoring  high  information 
regions  (like  information  grid).  This  planner  uses  2  parameters:  the  number  of  candidate  vertices  nv 
and  the  padding  dp  around  each  vertex.  Both  these  parameters  effect  how  candidate  vertices  for  the 
Dijkstra  algorithm  are  chosen.  More  specifically,  starting  with  an  empty  set  of  vertices,  the  planner 
finds  the  grid  cell  u  with  the  lowest  cost  k{u),  and  adds  this  grid  cell  to  the  set  of  vertices.  Then,  all 
grid  cells  within  dp  (according  to  the  norm  on  the  partition  space  £,)  are  removed  as  possible  vertex 
candidates.  This  procedure  now  repeats  by  choosing  the  next  lowest  cost  vertex,  adding  it  to  the 
set  of  vertices,  and  removing  neighboring  cells.  The  set  of  vertices  is  increased  until  nv  have  been 
identified.  Then,  each  vertex  is  connected  to  every  other  vertex,  and  the  edge  cost  of  u  to  v  is  the 
same  exponential  distance  used  previously: 

w(u,v)  =  ellu~vh  (7.10) 

Examples  of  this  planner  for  nv  =  100  and  dp  =  5  arc  shown  in  Figure  7.4.  The  paths  generated 
are  similar  to  the  node  centers  paths,  but  instead  of  routing  through  a  node  center,  these  paths  tend 
to  ride  the  edge  of  detection.  In  fact,  the  paths  produced  by  this  planner  are  quite  similar  to  the 
manual  paths  used  in  Sections  6.2  and  6.4.  Hence,  a  hybrid  planner  exploiting  node  centers  and 
information  clusters  may  provide  results  superior  to  each  planner  alone  as  demonstrated  previously 
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Figure  7.5:  Paths  generated  by  the  information  clusters  path  planner  for  a  fixed  mote  topology.  The 
padding  is  fixed  at  dp  =  8  cells,  and  nv  varies  across  the  set  {25, 50, 100}  from  left  to  right. 

with  a  hybrid  manual  and  node  centers  path.  Overall,  this  algorithm  has  the  advantage  of  choosing 
intermediate  way -points  in  high  information  regions  and  choosing  paths  with  many  short  jumps 
between  way-points.  Finally,  the  time  required  for  this  algorithm  is  0(n~,),  and,  hence,  is  set  by  the 
nv  parameter  which  is  typically  much  less  than  «/. 

One  potential  disadvantage  of  the  information  clusters  planner  is  the  need  to  choose  2 
parameters  nv  and  dp  for  a  particular  deployment.  To  better  understand  these  parameters,  we  observe 
the  effect  varying  them  has  on  the  generated  paths.  In  Figure  7.5,  the  padding  has  been  fixed  at 
dp  =  8  cells,  and  the  number  of  vertices  nv  is  varied  across  the  set  {25, 50, 100}  from  left  to  right. 
Increasing  the  number  of  vertices  provides  the  planner  with  more  way-points  to  choose  from.  Since 
the  padding  is  fixed,  additional  way-points  are  chosen  far  from  previous  ones.  This  tends  to  make 
the  generated  path  more  jagged:  it  takes  more  frequent  short  jumps.  Typically,  this  generates  a  path 
that  more  precisely  follows  the  high  information  regions. 

In  Figure  7.6,  the  number  of  candidate  vertices  is  fixed  at  nv  =  50,  and  the  padding  dp  is 
varied  across  the  set  {2,4,8}  from  left  to  right.  Increasing  the  padding  forces  the  candidate  way- 
points  to  be  more  spread  out.  As  shown  in  the  plots,  this  tends  to  generate  paths  that  deviate  farther 
from  a  straight  path  than  paths  with  less  padding.  However,  if  padding  is  increased  more,  the  gener¬ 
ated  paths  may  adhere  closer  to  straight  paths:  since  candidate  way-points  are  pushed  far  away  from 
a  good  route,  the  path  tends  to  use  fewer  points  creating  a  straighter  line.  In  the  following,  we  will 
denote  the  information  clusters  planner  configured  with  nv  vertices  and  dp  padding  as  information 
clusters  (nv,  dp). 
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Figure  7.6:  Paths  generated  by  the  information  clusters  path  planner  for  a  fixed  mote  topology.  The 
number  of  candidate  vertices  is  fixed  at  nv  -  50,  and  the  padding  dp  varies  across  the  set  {2,4,  8} 
from  left  to  right. 

7.3  Closed-Loop  Performance 

Using  the  path  planners  developed  in  the  previous  section,  this  section  performs  several 
localization  simulations.  In  particular,  the  navigation  system  can  now  be  tested  in  a  closed-loop  con¬ 
figuration  for  several  scenarios.  This  section  introduces  the  localization  simulation  setup,  performs 
a  set  of  closed-loop  simulations,  and  analyzes  the  results. 

To  fully  test  the  navigation  system,  the  localization  simulations  tests  each  path  planner 
in  a  closed-loop  configuration  for  multiple  way-points  and  for  multiple  sensor  network  topologies. 
In  particular,  the  simulator  first  randomly  generates  10  different  sensor  network  topologies.  For 
completeness,  these  topologies  and  accompanying  information  maps  are  listed  in  Appendix  B.  Each 
topology  is  generated  by  uniformly  distributing  30  motes  on  the  100m  by  100m  field.  Then,  for  each 
topology,  a  set  of  mini-experiments  are  run.  For  each  mini-experiment,  the  system  configuration  and 
parameters  used  are  listed  in  Table  7.1.  The  improved  models  for  the  sensor,  dynamics,  and  prior 
developed  in  Section  6.3  are  utilized.  The  sensor  is  configured  to  have  a  90%  chance  of  detecting 
an  object  within  5  meters.  The  information  maps  are  computed  using  the  truncated  normal  prior 
with  a  (single-dimensional)  standard  deviation  of  0.75  m.  This  prior  is  truncated  at  the  3crp  radius 
creating  a  9  cell  by  9  cell  square  support.  The  initial  estimation  of  the  robot’s  location  is  given 
by  a  uniform  distribution  covering  a  21  by  21  square  of  cells.  Previous  simulations  used  an  initial 
prior  with  a  much  larger  diameter  (101  cells)  since  the  focus  was  on  estimation  convergence.  For 
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Field 

size 

100m  x  100m 

grid 

200  x  200 

sensor  distribution 

uniform 

Ns 

30 

Sensor 

model 

continuous,  circular,  binary 

r.s 

5m 

Ps 

0.90 

Estimated  Prior 

prior  model 

discretized,  truncated  normal 

frP 

0.75  m 

P 

3<Xp  =  2.25  m 

Estimator 

prior  model 

discrete,  box 

rP 

10  cells 

Dynamics 

model 

continuous,  normal  noise 

U 

5  m 

Cdfl 

0.84  m 

O'  d,0,min 

0.05  m 

D 

3  crd  =  2.5 1  m 

Pose 

/  /I  ;2  /3 

O’ 1 goal ’  lgoaP  Lgoal 

5  70  80  12 

10  90  22  83 

Simulation 

trials 

100 

path  planners 

xy,  yx,  straight,  node  centers, 
information  grid,  information  clusters 

Table  7.1:  System  models  and  parameters  used  for  the  closed- loop,  multiple  map  localization  sim¬ 
ulations. 
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the  current  simulations,  a  smaller  prior  is  used  and  the  focus  is  shifted  to  how  well  the  closed-loop 
system  can  maintain  a  tight  estimate  and  closely  follow  a  desired  path.  The  continuous,  normal 
noise  model  of  the  dynamics  is  used  with  5  meter  control  bounds  and  a  noise  variance  of  0.70  m2. 
Again,  since  these  experiments  focus  on  closed-loop  performance,  the  noise  variance  is  increased 
to  0.70  m2  from  previous  experiments  that  used  a  minimal  noise  variance  of  0.01  m2.  For  each  trial, 
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(upper,  left  corner). 

Each  mini-experiment  compares  9  different  path  planners.  Each  planner  is  depicted  in 
Figure  7.7  with  the  robot  path  and  estimated  path  for  a  typical  run  of  the  first  mini-experiment.  The 
top  3  plots  show  the  basic  path  planners  tested:  xy,  yx,  and  straight.  The  remaining  planners  are  in¬ 
telligent  planners:  a  node  center,  an  information  grid,  and  4  different  configurations  of  information 
clusters.  Clearly,  the  paths  chosen  vary  heavily  depending  on  the  planner  and  the  parameters.  An 
interesting  characteristic  of  the  intelligent  planners  is  that  sections  of  previous  paths  are  frequently 
reused  for  multiple  way-point  paths.  For  example,  using  the  information  clusters  planner  in  Fig¬ 
ure  7.7i,  when  the  robot  navigates  from  the  lower,  right  corner  to  the  upper,  left  corner,  much  of 
the  previous  path  is  reused.  This  leads  us  to  conjecture  that  a  deployed  sensor  network  implicitly 
creates  several  stable,  static  routes  that  are  consistently  useful  for  localizing  a  robot.  Such  routes 
could  prove  useful  for  a  variety  of  reasons:  the  path  planner  can  reduce  the  computation  required  by 
reusing  paths  or  several  unused  motes  in  remote  regions  could  be  re-deployed  next  to  a  heavy  use 
route. 


For  each  path,  resulting  statistics  are  first  averaged  for  each  mini-experiment,  and  then 
across  mini-experiments.  Additionally,  statistics  are  collected  after  the  first  way-point  has  been 
reached  providing  time  for  the  system  to  settle.  Several  key  results  from  the  simulations  are  shown 
in  Table  7.2.  Notice  that  the  basic  planners  are  challenged  to  reduce  the  mean  estimation  error 
below  1.60  meters.  Additionally,  the  most  basic  intelligent  path  planner,  node  centers,  improves  the 
estimation  performance  over  the  best  performing  basic  planner,  straight,  by  4.40%.  The  remaining 
intelligent  planners  progressively  push  the  mean  estimation  error  down  to  the  1.3  meter  range  and 
outperform  the  best  simple  path  planner  by  anywhere  from  12.58%  to  19.50%.  Next,  notice  that 
the  node  centers  planner  generates  a  lengthy  path  (49.06%  longer  than  the  shortest  path  planner, 
straight)  with  a  high  number  of  detections  per  100  meters  (24.34%  more  detections  than  the  next 
largest  value).  Hence,  this  path  planner,  which  ignores  the  information  map,  tends  to  produce  paths 
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(a)  xy 


(b)  yx 


(c)  straight 


(d)  node  centers 


(e)  information  grid  (f)  information  clusters  (15,1) 


Figure  7.7:  Basic  simulation  overview  plots  for  the  multiple  way-point,  multiple  map  localization 
simulations.  Depicted  is  the  desired  robot  path  (thick  solid  red  line),  the  actual  robot  path  (solid 
black  line),  and  the  typical  estimated  robot  path  (solid  light  green  line)  superimposed  on  the  infor¬ 
mation  map.  Each  figure  is  captioned  with  the  path  planner  and  parameters  used. 
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path 

mean 

length  (m) 

nd 

nd/ 100 m 

mean 

estimation 
error  (m) 

mean 

entropy 

xy 

352.00 

24.75 

7.03 

1.67 

4.40 

yx 

352.00 

22.31 

6.34 

1.64 

4.40 

straight 

263.16 

23.14 

8.79 

1.59 

4.43 

node  centers 

392.27 

66.82 

17.06 

1.52 

4.35 

information  grid 

272.33 

26.14 

9.60 

1.39 

4.13 

information  clusters(15,l) 

337.12 

42.15 

12.52 

1.35 

4.20 

information  clusters(60,12) 

458.88 

63.14 

13.72 

1.32 

4.07 

information  clusters(40,15) 

429.28 

56.48 

13.16 

1.35 

4.18 

information  clusters(  100,5) 

461.18 

60.52 

13.13 

1.28 

3.90 

Table  7.2:  Simulation  statistics  collected  for  each  path  planner  running  a  closed-loop  localization 
simulation  across  10  node  topologies  with  100  trials  each. 


which  provide  little  apparent  benefit  over  simple  path  planners.  Other  intelligent  planners,  however, 
exploiting  the  information  map,  tend  to  generate  more  suitable  paths.  For  instance,  the  information 
grid  planner  not  only  generates  paths  with  lengths  that  rival  the  straight  path  planner  (only  3.48% 
longer)  and  reduces  detections  per  100  meters  to  a  modest  level  (only  9.22%  more  than  the  best 
performing  basic  path  planner),  but  it  also  reduces  the  estimation  error  significantly  (anywhere 
from  12.58%  to  16.77%  over  the  basic  schemes).  Such  a  planner  would  likely  be  preferred  to  the 
basic  planners  in  most  scenarios. 

The  information  clusters  planners,  as  a  group,  produce  the  least  estimation  error.  They 
also  tend  to  produce  some  of  the  longest  paths  with  a  high  number  of  detections  per  meter,  though 
still  fewer  detections  than  required  by  the  node  centers  planner.  These  planners  tend  to  sacrifice 
path  length  and  number  of  detections  for  reduced  estimation  error  and  computation  time.  Finally, 
the  closed-loop  performance  provides  more  support  for  the  information  based  planners.  Due  to  the 
accuracy  of  the  estimation,  the  basic  path  schemes  and  the  node  centers  scheme  tend  to  track  the 
desired  path  to  within  about  1.03  meters;  however,  the  other  planners  track  the  desired  path  within 
about  0.85  meters  (a  17.48%  improvement). 

This  chapter  began  with  providing  a  more  tangible  interpretation  of  information  as  cost. 
Using  this  interpretation,  several  path  planners  were  developed  that  generated  varying  paths  de¬ 
pending  on  the  mote  topology.  Finally,  several  sets  of  closed-loop  simulations  were  performed  to 
demonstrate  the  benefit  of  information  based  planners.  In  particular,  it  was  demonstrated  that  2 
varieties  of  information  based  planners  provided  improved  performance  over  all  the  simple  path 
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schemes  and  even  the  node  centers  planner  which  exploits  node  location  information.  The  next 
chapter  provides  an  overview  and  summary  of  our  research. 
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Chapter  8 

Conclusions 


Our  work  focused  on  how  to  develop  control  and  navigation  systems  that  operate  effec¬ 
tively  when  embedded  with  real  world  wireless  sensor  networks  (WSNs).  Such  networks  are  com¬ 
posed  of  many  small  sensing  and  computation  nodes,  or  motes.  Often  times,  each  mote  executes  a 
variety  of  services  such  as  time  synchronization,  localization,  entity  detection,  leader  election,  and 
message  communication.  The  composite  system  provides  not  only  a  complex  distributed  sensor, 
but  also  an  interactive  and  variable  sensing  mechanisms.  To  further  our  understanding  of  control 
system  design  within  such  an  environment,  we  began  by  deploying  several  real  world  sensor  net¬ 
works.  From  this  experience,  we  extracted  models  of  the  sensing  platform  and  developed  practical 
techniques  for  architecting  control  systems.  Next,  using  robot  navigation  as  our  benchmark,  we  de¬ 
veloped  a  simulator  and  tested  several  control  system  designs.  This  work  identified  intelligent  path 
planning  as  an  important  method  for  improving  localization  and  navigation  in  sensor  networks.  Us¬ 
ing  an  information  topology,  we  developed  several  path  planning  techniques,  and  demonstrated  that 
these  techniques  could  reduce  bandwidth  utilization  and  improve  localization  accuracy. 

One  of  the  initial  sensor  network  and  control  (SNAC)  systems  that  we  deployed  is  de¬ 
scribed  in  Chapter  2.  A  grid  of  121  motes  was  deployed  in  an  outdoor  field  for  playing  mobile 
robot  pursuit-evasion  games  (PEGs).  This  chapter  began  by  discussing  the  general  sensing  and 
communication  algorithm  implemented  by  the  sensor  network.  These  details  allowed  the  reader 
to  understand  the  complexity  of  using  WSNs  as  a  distributed  sensor.  Additionally,  the  control  and 
navigation  system  utilized  by  the  pursuer  robot  was  presented.  This  system  relied  on  2  sensors:  the 
sensor  network  and  an  on-board  GPS  sensor.  The  GPS  sensor  was  necessary  to  provide  reliable 
estimates  of  the  pursuer  location.  Without  such  estimates,  it  was  unclear  how  the  pursuit  algorithm 
should  be  designed. 
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Once  deployed,  we  turned  to  characterizing  the  performance  and  results  of  this  system. 
In  particular,  PEG  was  deemed  a  successful  deployment  with  the  pursuer  robot  capturing  the  evader 
in  all  runs.  However,  the  performance  of  the  sensor  network  was  less  than  expected.  An  analogous 
deployment  indicated  that  detections  from  the  sensor  network  arrived  every  3.02  seconds,  with  a 
latency  of  1.75  seconds,  and  an  detection  error  of  2.42  meters.  After  the  application  of  several 
idealistic  filtering  techniques  (zero  movement  noise  Kalman  filtering,  latency  removal,  and  faulty 
detection  removal),  the  final  estimation  error  could  be  reduced  to  1.53  meters.  We  noted  that  these 
characteristics  are  dramatically  different  from  the  GPS  sensor  which  provides  robot  position  esti¬ 
mation  at  10  Hz  with  0.02  meters  accuracy.  Hence,  we  identified  the  primary  distinction  between  a 
high  performance  local  sensor  (GPS)  and  a  large-scale  distributed  sensor  (WSN).  In  order  to  facili¬ 
tate  the  design  of  a  control  system  informed  solely  by  a  WSN,  we  identified  several  characteristics 
of  sensor  networks  that  challenge  the  performance  of  traditional  control  systems.  These  challenging 
characteristics  were  grouped  into  3  categories:  sensor  error,  false  events,  and  network  induced  error. 

Using  our  experience  from  PEG  and  the  list  of  challenging  characteristics,  the  next  chap¬ 
ter,  Chapter  3,  developed  a  general  control  architecture.  In  particular,  for  each  challenge,  a  list  of 
services  useful  for  combating  each  challenge  was  developed.  In  total,  several  services  were  iden¬ 
tified:  predictive  control,  neighborhood  model  based  estimation,  intelligent  path  planning,  multi¬ 
modal  control  and  networking,  sensing  coordination,  time  synchronization,  hand  shaking,  and  mote 
maintenance.  Using  these  services,  a  unified  architecture  for  both  the  control  system  and  the  motes 
was  developed.  We  noted  that  viewing  the  control  architecture  as  a  service  based  architecture  has 
the  advantage  of  freeing  the  designer  from  the  traditional  control  system  paradigm.  More  specifi¬ 
cally,  a  service  based  architecture  allows  the  designer  to  create  services  that  interact  and  vary  the 
sensing  system. 

After  enumerating  both  a  set  of  challenges  and  a  set  of  solutions,  Chapter  4  began  our 
development  of  a  simulator  for  further  investigating  SNAC  systems.  Using  our  previous  experi¬ 
ence  and  results  from  the  literature,  models  were  developed  for  the  sensor  network  that  account 
for  many  properties  of  real  world  deployments:  sensing  noise,  sensor  saturation,  calibration  dif¬ 
ferences,  packet  collision,  radio  reception  range,  multi-hop  latency,  finite  battery  lifetime,  faulty 
hardware,  and  loose  time  synchronization.  More  specifically,  we  developed  a  sensor  model  based 
on  the  magnetometer  used  during  the  PEG  deployment.  Objects  are  detected  in  accordance  with 
a  magnetic  dipole  far  field  model.  Additionally,  the  operational  status  of  the  sensor  hardware  was 
modeled  with  a  state  machine  permitting  several  states:  normal,  saturation,  noisy,  unresponsive,  and 
unpredictable.  Next,  we  developed  a  communication  layer  model  that  operates  in  2  states:  broadcast 
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and  multi-hop.  The  broadcast  communication  model  probabilistically  accounted  for  MAC  delays, 
packet  collision,  and  reception  ranges.  The  multi-hop  communication  model  accounted  for  packet 
loss  and  cumulative  packet  latency.  Next,  we  developed  a  mote  platform  model  accounting  for  a 
mote’s  finite  lifetime  and  individual  time  synchronization.  Finally,  we  modeled  the  detection  rou¬ 
tine  used  during  the  PEG  trials.  In  addition  to  our  WSN  models,  we  also  developed  a  robot  model 
based  on  car  like  kinematics. 

After  developing  our  system  models,  we  instantiated  control  and  estimation  services  from 
our  unified  control  architecture.  For  the  controller,  2  distinct  way-point  navigation  services  were 
developed:  feedback  and  predictive  control.  For  a  basic  estimation  service,  an  extended  Kalman  fil¬ 
ter  was  designed.  Two  additional  augmentations  to  the  estimator  were  developed:  network  latency 
compensation  and  faulty  node  filtering.  Using  the  aforementioned  models  and  the  newly  developed 
control  and  estimation  services,  several  simulations  were  ran.  First,  we  ran  a  set  of  simulations  to 
compare  the  performance  of  the  control  and  estimation  services.  These  simulations  reinforced  the 
difficulty  of  applying  basic  feedback  control  to  a  SNAC  system.  Additionally,  we  demonstrated  that 
our  estimation  and  control  architecture  was  able  to  achieve  the  best  navigation  results:  the  goal  was 
more  frequently  achieved  and  the  overall  estimation  error  was  less.  Hence,  our  control  architec¬ 
ture  utilizing  model  predictive  control  and  an  estimation  system  tailored  to  the  sensor  network  was 
more  effective  for  way -point  navigation  than  traditional  control  techniques.  Next,  to  understand  the 
relationship  between  a  robot’s  path  and  the  localization  accuracy,  we  ran  several  simulations  with 
varying  robot  paths.  In  particular,  some  paths  were  chosen  to  lead  the  robot  through  areas  densely 
covered  in  sensors  with  the  intention  to  decrease  the  estimation  error.  These  simulations  revealed 
the  complex  nature  of  navigation  in  sensor  networks.  Our  results  indicated  that  path  planning  can 
significantly  effect  the  number  of  detections,  the  detection  error,  and  the  overall  estimation  error. 
However,  it  was  not  clear  how  a  path  should  be  planned  to  consistently  improve  estimation  accu¬ 
racy.  We  observed  that  paths  through  densely  covered  areas  produced  both  more  and  less  detection 
error  than  our  baseline,  straight  path.  Additionally,  such  paths  consistently  decreased  the  position 
estimation  error,  but  not  the  orientation  estimation  error. 

Intrigued  by  these  results,  in  Chapter  5,  we  began  to  formalize  the  relationship  between 
path  and  localization  accuracy.  We  began  by  reviewing  Markov  localization  and  information.  Next, 
we  formalized  the  computation  of  an  information  map,  with  particular  attention  being  paid  to  the 
time  complexity.  For  situations  where  the  computation  of  the  sensor  model  and  prior  model  require 
constant  time,  we  showed  that  the  information  map  computation  has  a  time  complexity  of  0(njny) 
where  iy  is  the  cardinality  of  the  pose  space  and  ny  is  the  cardinality  of  the  measurement  space.  Not- 
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ing  that  this  time  complexity  can  be  limiting  in  practice,  we  reviewed  an  approximate  information 
map  computation  algorithm  developed  by  Roy  et  al  [78].  This  algorithm  was  shown  to  reduce  the 
time  complexity  by  computing  across  a  reduced  pose  and  measurement  space.  Next,  we  developed 
a  sensor  model  and  a  robot  prior  model  and  computed  several  information  maps.  In  particular,  we 
introduced  the  discrete  binary  sensor  model  and  the  discrete  box  prior  model.  Using  these  models 
and  a  uniformly  deployed  sensor  network,  we  varied  the  size  of  the  prior  and  computed  several 
information  maps.  We  discussed  the  time  required  to  compute  these  maps  on  modern  hardware  and 
discussed  how  the  information  map  could  be  updated  in  real  time  to  compensate  for  sensor  nodes 
coming  on-line  or  going  off-line.  Additionally,  we  noted  that  regions  both  far  from  sensing  areas 
and  close  to  node  locations  were  low  in  information;  whereas,  regions  close  to  the  edge  of  detec¬ 
tion  were  high  in  information.  We  noted  that  the  information  topology  explains  why  a  path  planner 
developed  to  simply  drive  in  densely  covered  regions  or  close  to  nodes  may  fail  to  provide  the  best 
localization  results. 

Verifying  the  error  reducing  ability  of  information  based  paths  with  simulations  was  car¬ 
ried  out  in  Chapter  6.  First,  Markov  localization  was  formalized  as  an  algorithm.  We  noted  the  time 
complexity  for  this  computation:  0(njti  +  n/ts)  where  tj  and  ts  are  the  times  required  to  compute 
the  transition  distribution  and  the  sensor  model,  respectively.  Next,  we  introduced  the  basic  discrete 
adjacent  cell  dynamics  for  robot  movement  and,  using  these  dynamics,  ran  several  localization  sim¬ 
ulations  for  a  set  of  fixed  paths.  In  particular,  we  compared  5  paths:  3  basic  paths  (xy,  yx,  straight), 
a  node  by  node  path  (node  centers),  and  a  manually  chosen  information  based  path  (manual).  It 
was  demonstrated  that  the  information  based  path  achieves  the  lowest  mean  estimation  error  and 
mean  entropy  along  its  path,  while  requiring  only  a  modest  number  of  detections.  Furthermore,  we 
demonstrated  that  the  node  centers  path  achieves  the  next  lowest  error,  but  requires  210.00%  more 
detections.  Hence,  it  was  shown  that  the  simple  and  intuitive  path  planning  scheme,  to  go  node  by 
node,  is  far  from  optimal. 

Next,  the  models  used  for  the  localization  simulations  were  replaced  with  more  realistic 
models.  First,  the  robot  dynamics  were  allowed  to  become  continuous  with  normally  distributed 
noise  that  varies  with  the  control  magnitude.  This  model  was  then  systematically  tessellated  for  use 
with  Markov  localization.  The  resulting  transition  function  was  a  complex  function  that  incurs  high 
computation  costs.  To  address  this,  the  transition  function  was  relaxed  to  consider  only  the  most 
likely  transitions,  and  a  new  update  computation  was  used  in  place  of  the  standard  Markov  move¬ 
ment  update.  It  was  shown  that  the  new  computation  reduces  the  time  complexity  of  the  movement 
update  from  O(nf)  to  something  less  than  0(ni).  Next,  the  robot  prior  was  relaxed  to  be  continuous 
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and  normally  distributed.  This  model  was  easily  tessellated  for  use  in  the  simulations.  Finally,  the 
sensor  model  was  relaxed  to  have  a  circular  detection  region.  Again,  during  the  tessellation  of  this 
model,  a  simple  approximation  was  made  for  the  sake  of  computation  time.  With  the  new  models 
and  their  computationally  efficient  counterparts,  we  again  ran  several  localization  simulations  using 
the  same  path  schemes  as  before  (xy,  yx,  straight,  node  centers,  manual)  with  the  addition  of  one: 
a  hybrid  scheme.  The  hybrid  path  scheme  was  developed  to  use  both  the  node  centers  path  and 
the  manual  path  in  order  to  quickly  reduce  and,  henceforth,  maintain  a  low  estimation  error.  This 
scheme  was  shown  to  outperform  the  other  schemes. 

With  verification  that  information  based  paths  perform  well  for  realistic  models,  we  then 
moved  on  to  developing  automatic  path  planners  in  Chapter  7.  In  this  chapter,  we  presented  a 
brief  introduction  to  entropy,  information,  and  interpretation  of  information  maps.  Using  this  back¬ 
ground,  3  automatic  path  planners  were  developed:  node  centers,  information  grid,  and  information 
clusters.  The  node  centers  planner  embodied  the  concept  of  having  a  robot  navigate  node  by  node 
to  the  destination.  Hence,  this  planner  made  use  of  the  node  topology.  The  information  grid  and 
information  clusters  planners  additionally  incorporated  knowledge  of  the  information  topology.  In 
particular,  the  information  clusters  planner  was  designed  to  mimic  the  manual  paths  previous  used. 
Next,  all  the  path  planners  and  various  configurations  of  them  were  tested  using  a  closed-loop,  mul¬ 
tiple  way -point  simulation.  It  was  shown  that  the  node  centers  path  planner  has  performance  similar 
to  that  of  the  simple  path  planners  (xy,  yx,  straight).  Whereas,  the  performance  of  information  based 
planners  outperformed  the  other  planners  by  providing  reduced  localization  error  and  path  devia¬ 
tion.  We  observed  that,  the  information  grid  path  planner  provided  a  short  path  to  the  destination 
and  low  bandwidth  utilization  and  the  information  clusters  planner  provided  the  best  localization 
accuracy  and  path  following  abilities. 

Overall,  our  work  provided  insight  into  real  world  SNAC  system  deployments  with  an 
emphasis  on  way-point  navigation.  We  deployed  one  of  the  first  and  largest  SNAC  systems.  We 
identified  the  challenges  for  such  systems  and  developed  a  control  and  estimation  solution  in  the 
form  of  a  service  based  architecture.  We  developed  a  practical  WSN  model  based  on  real  world 
experience  and  previous  results  from  the  literature.  We  combined  our  architecture  and  our  WSN 
model  into  an  application  level  simulator  suitable  for  studying  control  problems.  We  demonstrated 
the  benefits  of  several  of  the  services  from  our  architecture.  Then,  with  a  focus  on  the  intelligent 
path  planning  service,  we  generated  information  based  path  planners  and  demonstrated  their  ability 
to  reduce  localization  error  and  path  deviation. 
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Appendix  A 

Sensor  Network  and  Control  System 
Simulator  Parameter  Values 


This  section  provides  a  list  of  the  parameter  values  used  by  the  models  in  the  sensor 
network  and  control  system  simulator  discussed  in  Chapter  4. 

•  Physical  layout 

-  Field  dimensions:  20  m  by  20  m 

-  Number  of  motes:  100 

-  Mote  distribution:  uniform 

•  Sensor  model 

-  Dipole  height  off  ground:  dm  =  2  m 

-  Dipole  strength:  M  =  2x10  ^  =  10 1‘1’"  A/m  allowing  Bo(p  =  0)  =  10^4  T  =  1  G 

-  Permeability  of  free  space:  po  =  An  x  10  7  Wb/(Am) 

-  Saturation  threshold:  Btines  -  980  mG 

-  Saturation  measurement:  Bsat  =  980  mG 

-  Saturation  duration:  Tsat  =  2  s 

-  Normal  sensors:  1  -  p„  -  pu,  -  pup  -  94  % 

*  Bias  distribution:  Af(0, 202)  mG 

*  Noise  variance  distribution:  N(§,  202)  mG 
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-  Noisy  sensors:  pn  =  2% 

*  Bias  distribution:  N(0, 202)  mG 

*  Noise  variance  distribution:  N( 0, 1002)  mG 

-  Unpredictable  sensors:  pup  =  2  % 

*  Measurement  distribution:  N(pup,cr^ip) 

*  Distribution  standard  deviation:  crup  =  50  mG 

*  Distribution  mean:  pup  =  -crup  V2erf  ](2pd,,ect  -  1)  +  Bthres  where  p detect  =  0.05 
is  the  probability  of  the  measurement  exceeding  the  detection  threshold. 

-  Unresponsive  sensors:  pur  =  2  % 

•  Communication  model 

-  Packet  drop  probability:  pdrop  =  0-30 

-  Single  hop  lag  distribution:  U[0, 0.3]  s 

-  Reception  probability  {d  in  meters)  :  Pdist(d)  ~  11(2.4 1 4. 1) 

-  Average  hop  distance:  d/wp  =  2.414  m 

•  Mote  platform 

-  Low  energy  probability:  pe  =  2% 

-  Mote  expiration  time  distribution:  'ZYfO,  30]  s 

-  Mote  initial  sensing  time  distribution:  rU\0. 1.5]  s 

•  Mote  algorithm  parameters 

-  Reporting  threshold:  Breport  =  136  mG  where  the  sensor  will  nominally  detect  at  2  V2 
meters 

-  Reporting  period:  Treport  =  0.3  s 

•  Car  model 

-  Wheelbase:  b  =  0.15  m 

-  Wheel  speed  bounds:  v  e  [-0.25,0.25]  m/s 

-  Turning  angle  bounds:  (t>  £  [-20, 20]  degrees  providing  a  0.52  m  turning  radius 


-  Wheel  speed  error  variance:  cr 1  =  0.01 

-  Steering  angle  error  variance:  o ^  =  0.01 

Feedback  controller  parameters 


-  Wheel  speed  bound  estimate:  vmax  =  0.3  m/s 

-  Proportional  constant:  =  1 


Estimation  parameters 


-  p(n\v)  =  < 


0 


if  n  <  1  , 


1  otherwise. 

-  Expected  packet  delay:  Td  =  1  s 

-  Wheel  speed  bounds  estimate:  [vmin,vmax]  =  [-0.3, 0.3]  m/s 

-  Steering  angle  bounds  estimate:  I <Pmax\  =  [-0.4, 0.4]  radians 


Simulator  parameters 


-  Sample  time:  T  =  0.03  s 

lT 


-  Initial  state: 


xq  9q 


5  5  3tt/2 


-  Initial  state  estimate: 


Xq  % 


-  Destination  position:  x/  = 


4  4  nil 

-\T 


10  10 
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Appendix  B 

Node  Topologies  for  Closed-Loop 
Simulations 


This  section  provides  a  depiction,  in  Figure  B.l,  of  the  node  topologies  and  information 
maps  used  during  the  multiple  map,  multiple  way  point,  closed-loop  simulations  of  Section  7.3. 
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(a)  map  1  (b)  map  2  (c)  map  3 


(d)  map  4 


50  100 

(g)  map  7 


(f)  map  6 


50  100 

(h)  map  8 


(i)  map  9 


(j)  map  10 


Figure  B.  1 :  The  10  sensor  network  topologies  and  accompanying  information  maps  used  during  the 
closed-loop  simulations. 


